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Chapter 1 

Introduction to atomic physics 



1.1 The importance of atomic physics 

Atomic Physics is the subject that studies the inner workings of the atom. It remains one of the most 

important testing grounds for quantum theory, and is therefore still an area of active research, both for 
its contribution to fundamental physics and to technology. Furthermore, many other branches of science 
rely heavily on atomic physics. The following list gives a few examples: 

Physics Astrophysics, plasma physics, atmospheric physics, solid state physics, chemical physics and 
radiation physics. 

Other science Chemistry (analysis, reaction rates), biology (molecular structure, physiology), materials 

science, energy research, fusion studies. 

Applications Lasers, X-ray technology, NMR, pollution detection, medical applications of devices 
(lasers, NMR etc.). 

In this chapter we shall give a brief overview of the main effects that will be considered in this course, 
and define a few important terms and concepts that underlie the whole subject. We shall also give a 
brief review of the Bohr theory, which was the basis of the "old" (i.e. pre-quantum mechanics) quantum 
theory of the atom. 

1.2 Atomic spectroscopy 

We gain most of our knowledge of atoms from studying the way light interacts with matter, and in 
particular from measuring atomic spectra. Optics has therefore played a key role in the development 
of atomic physics. The extreme precision with which optical spectral lines can be measured makes 
atomic physics the most precise branch of physics. For example, the frequencies of the spectral lines of 
hydrogen have been measured with extremely high accuracy, permitting the testing of small but important 
phenomena that are normally unobservable. 

The basis for atomic spectroscopy is the measurement of the energy of the photon absorbed or emitted 
when an electron jumps between two quantum states, as shown in Fig. 1.1. These are called radiative 
transitions. The frequency (ly) of the photon (and hence its wavelength. A) is determined by the 
difference in energy of the two levels according to: 

he 

hiy= — =E2-E,, (1.1) 

where Ei and E2 are the energies of the lower and upper levels of the atom respectively. 

Spectroscopists measure the wavelength of the photon, and hence deduce energy differences. The 
absolute energies are determined by fixing one of the levels (normally the ground state) by other methods, 
e.g. by measurement of the ionization potential. 

1.3 Energy units 

Atomic energies are frequently quoted in electron volts (eV) . 1 eV is the energy acquired by an electron 
when it is accelerated by a voltage of 1 Volt. Thus 1 eV = 1.6 x 10~^^ J. This is a convenient unit, 
because the energies of the electrons in atoms are typically a few eV. 
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Figure 1.1: Absorption and emission transitions. 



Energy scale 


Energy (eV) 


Energy (cm ^) 


Contributing effects 


Gross structure 


1 - 10 


10'^ - 10^ 


electron-nuclear attraction 
electron-electron repulsion 
electron kinetic energy 


Fine structure 


0.001 - 0.01 


10 - 100 


spin-orbit interaction 
relativistic corrections 


Hyperfine structure 


10-^ - 10-^ 


0.01 - 0.1 


nuclear interactions 



Table 1.1: Rough energy scales for the different interactions that occur within atoms. 

Atomic energies are also often expressed in wave number units (cm^^). The wave number V is the 
reciprocal of the wavelength of the photon with energy E. It is defined as follows: 



1 



E 



A (in cm) c he 



(1.2) 



Note that the wavelength should be worked out in cm. Thus leV — (e/hc) cm~^ = 8066 cm~^. 

Wave number units are particularly convenient for atomic spectroscopy. This is because they dis- 
pense with the need to introduce fundamental constants in our calculation of the wavelength. Thus the 
wavelength of the radiation emitted in a transition between two levels is simply given by: 



= V2-~ 1^1, 



(1.3) 



where and 1^2 are the energies of levels 1 and 2 in cm ^ units, and A is measured in cm. 

1.4 Energy scales in atoms 

In atomic physics it is traditional to order the interactions that occur inside the atom into a three-level 
hierarchy according to the scheme summarized in Table 1.1. The effect of this hierarchy on the observed 
atomic spectra is illustrated schematically in Fig. 1.2. 

1.4.1 Gross structure 

The first level of the hierarchy is called the gross structure, and covers the largest interactions within 
the atom, namely: 

• the kinetic energy of the electrons in their orbits around the nucleus; 

• the attractive electrostatic potential between the positive nucleus and the negative electrons; 

• the repulsive electrostatic interaction between the different electrons in a multi-electron atom. 
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Figure 1.2: Hierarchy of spectral lines observed with increasing spectral resolution. 



The size of these interactions gives rise to energies in the 1-10 eV range and upwards. They thus determine 
whether the photon that is emitted is in the infrared, visible, ultraviolet or X-ray spectral regions, and 
more specifically, whether it is violet, blue, green, yellow, orange or red for the case of a visible transition. 



1.4.2 Fine structure 

Close inspection of the spectral lines of atoms reveals that they often come as multiplets. For example, 
the strong yellow line of sodium that is used in street lamps is actually a doublet: there are two lines with 
wavelengths of 589.0 nm and 589.6 nm. This tells us that there are smaller interactions going on inside 
the atom in addition to the gross structure effects. The gross structure interactions determine that the 
emission line is yellow, but fine structure effects cause the splitting into the doublet. In the case of the 
sodium yellow line, the fine structure energy splitting is 2.1 x 10~''eV or 17cm~^. 

Fine structure arises from the spin-orbit interaction. Electrons in orbit around the nucleus are 
equivalent to current loops, which give rise to atomic magnetism. The magnitude of the magnetic dipole 
moment of the electron is typically of the order of the Bohr magneton /xb: 

H^ = -^= 9.27 X 10-24 J . (1.4) 

The atomic dipoles generate strong magnetic fields within the atom, and the spin of the electron can 
then interact with the internal field. This produces small shifts in the energies, which can be worked out 
by measuring the fine structure in the spectra. In this way we can learn about the way the spin and 
the orbital motion of the atom couple together. In more advanced theories of the atom (e.g. the Dirac 
theory), it becomes apparent that the spin-orbit interaction is actually a relativistic effect. 



1.4.3 Hyperfine structure 

Even closer inspection of the spectral lines with a very high resolution spectrometer reveals that the 
fine-structure lines are themselves split into more multiplets. The interactions that cause these splitting 
are called hyperfine interactions. 

The hyperfine interactions are caused by the interactions between the electrons and the nucleus. The 
nucleus has a small magnetic moment of magnitude ^ /xb/2000 due to the nuclear spin. This can interact 
with the magnetic field due to the orbital motion of the electron just as in spin-orbit coupling. This gives 
rise to shifts in the atomic energies that are about 2000 times smaller than the fine structure shifts. The 
well-known 21 cm line of radio astronomy is caused by transitions between the hyperfine levels of atomic 
hydrogen. The photon energy in this case is 6 x 10^^ eV, or 0.05 cm^^. 



1.5 External fields 

Further information about the inner workings of atoms can be obtained by studying the effects of external 
magnetic and electric fields. (See Table 1.2.) These effects will be considered in detail in Chapter 7. 

The splitting of atomic spectral lines when a weak external magnetic field is applied is called the 
Zeeman eflfect. When the field strength is very large, a different type of effect is observed called the 
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Applied field 


Field strength 


Effect 


Magnetic 


weak 


Zeeman 




strong 


Paschen-Back 


Electric 


all 


Stark 



Table 1.2: Names of the effects of external fields in atomic physics. 




Figure 1.3: The Bohr model of the atom considers the electrons to be in orbit around the nucleus. The 
central force is provided by the Coulomb attraction. The angular momentum of the electron is quantized 
in integer units of h. 

Paschen-Back effect. However, for most atoms, the size of the field required to observe the transition 
from the Zeeman effect to the Paschen-Back effect is beyond the range of laboratory magnets, and so 
only the Zeeman effect is commonly observed. 

In the case of electric fields, the weak and strong field limits are not normally distinguished, and all 
the phenomena are collectively called the Stark effect. These effects are named after J. Stark, who 
was the first person to study the effect of electric fields on atomic spectral lines, when he measured the 
splitting of the hydrogen Balmer lines in an electric field in 1913. 

1.6 The Bohr model 

In 1911 Rutherford discovered the nucleus. This then led to the idea of atoms consisting of electrons 
in classical orbits in which the central forces are provided by the Coulomb attraction to the positive 
nucleus, as shown in Fig. 1.3. The problem with this idea is that the electron in the orbit is constantly 
accelerating. Accelerating charges emit radiation called bremsstrahlung, and so the electrons should 
be radiating all the time. This would reduce the energy of the electron, and so it would gradually spiral 
into the nucleus, like an old satellite crashing to the earth. 

In 1913 Bohr produced his model for the atom. The key new elements of the model are: 

• The angular momentum L of the electron is quantized in units oi h [h = h/2'K): 

L^nh, (1.5) 

where n is an integer. 

• The atomic orbits are stable, and light is only emitted or absorbed when the electron jumps from 
one orbit to another. 

When Bohr made these hypotheses in 1913, there was no justification for them other than they were 
spectacularly successful in predicting the energy spectrum of hydrogen. With the hindsight of quantum 
mechanics, we now know why they work. The first assumption is equivalent to stating that the orbit 
must correspond to a fixed number of de Broglie wavelengths. For a circular orbit, this can be written: 

h h 

27rr = mtegcr x XdcB = n x — = n x , (1-6) 

p mv 
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which can be rearranged to give 

L = mvr = nx — . (1.7) 

The second assumpt ion is a consequence of the fact that the Schrodinger equation leads to time-independent 
solutions (eigenstates). 

The derivation of the quantized energy levels proceeds as follows. Consider an electron orbiting a 
nucleus of mass mN and charge +Ze. The central force is provided by the Coulomb force: 



As with all two-body orbit systems, the mass m that enters here is the reduced mass: 



(1.8) 



The energy is given by:^ 



111 , , 

- = — + . 1.9 

m nie rriN 



En = kinetic energy -|- potential energy 

1 2 Ze^ 
= ^mv 



47reor 



(1.10) 



where we made use of eqns 1.7 and 1.8 in the last line. This can be written in the form: 



En = -% (1.11) 



where R' is given by: 



and Roohc is the Rydberg energy:^ 



R' = [—Z^]Roohc, (1.12) 

The Rydberg energy is a fundamental constant and has a value of 2.17987 x 10""'^* J, which is equivalent 
to 13.606 eV. This tells us that the gross energy of the atomic states in hydrogen is of order 1-10 eV, 
or 10^ — 10^ cm~^ in wave number units. 

R' is the effective Rydberg energy for the system in question. In the hydrogen atom we have an 
electron orbiting around a proton of mass irip. The reduced mass is therefore given by 

m = meX — = 0.9995 me 

rrie + nip 

and the effective Rydberg energy for hydrogen is: 

i?H = 0.9995 i?oo/ic. (1.14) 

Atomic spectroscopy is very precise, and 0.05% factors such as this are easily measurable. Furthermore, 
in other systems such as positronium (an electron orbiting around a positron), the reduced mass effect is 
much larger, because m = me/2. 

By following through the mathematics, we also find that the orbital radius and velocity are quantized. 
The relevant results are: 

rn = ^ — ao , (1-15) 
Z m, 

^In atoms the electron moves in free space, where the relative dielectric constant €r is equal to unity. However, in solid 
state physics we frequently encounter hydrogenic systems inside crystals where er is not equal to 1. In this case, we must 
replace eo by e^eo throughout. 

Note the diflference between the Rydberg energy iloo/Jc (13.606 eV) and the Rydberg constant iloo (109,737 cm"!). The 
former has the dimensions of energy, while the latter has the dimensions of inverse length. They differ by a factor of he. 
(See Table 1.3.) When high precision is not required, it is convenient just to use the symbol Rn for the Rydberg energy, 
although, strictly speaking, iin differs from the true Rydberg energy by 0.05%. (See eqn 1.14.) 
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Quantity 


Symbol 


Formula 


Numerical Value 


Rydberg energy 


Roohc 


meeV8eg/i2 


2.17987 X 10-18 J 








13.6057 eV 


Rydberg constant 


Roo 




109,737 cm-i 


Bohr radius 


ao 




5.29177 X 10"" m 


Fine structure constant 


a 




1/137.04 



Table 1.3: Fundamental constants that arise from the Bohr model of the atom. 



and 

Vn = a— c . (1-16) 

n 

The two fundamental constants that appear here are the Bohr radius oq: 

ao = ^, (1.17) 

and the fine structure constant a: 

The fundamental constants arising from the Bohr model are related to each other according to: 

ao = — -, (1.19) 
nieC a 



and 



The definitions and values of these quantities are given in Table 1.3. 

The energies of the photons emitted in transition between the quantized levels of hydrogen can be 
deduced from eqn 1.11: 

where ni and n2 are the quantum numbers of the two states involved. Since u = c/X, this can also be 
written in form: 

A me^°° (n^ nl) ' (1-22) 

In absorption we start from the ground state, so we put ni = I. In emission, we can have any combination 
where ni < n2. Some of the series of spectral lines have been given special names. The emission lines 
with ni = 1 are called the Lyman series, those with ni ~ 2 arc called the Balmer series, etc. The 
Lyman and Balmer lines occur in the ultraviolet and visible spectral regions respectively. 



1.7 The need for quantum mechanics 

A simple back-of-the-envelope calculation can easily show us that the Bohr model is not fully consistent 
with quantum mechanics. In the Bohr model, the linear momentum of the electron is given by: 

(aZ\ nh ,^ 
1 mc = — . (1-23) 

However, we know from the Heisenberg uncertainty principle that the precise value of the momentum 
must be uncertain. If we say that the uncertainty in the position of the electron is about equal to the 
radius of the orbit r„, we find: 
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Figure 1.4: The Stern-Gerlach experiment. A beam of monovalent atoms with L ~ (i.e. zero orbital 
angular momentum and hence zero orbital magnetic dipole moment) is deflected in two discrete ways by 
a non-uniform magnetic field. The force on the atoms arises from the interaction between the field and 
the magnetic moment due to the electron spin. 



On comparing Eqs. 1.23 and 1.24 we see that 

\p\KnAp. (1.25) 

This shows us that the magnitude of p is undefined except when n is large. This is hardly surprising, 
because the Bohr model is a mixture of classical and quantum models, and we can only expect the 
arguments to be fully self-consistent when we approach the classical limit at large n. For small values of 
n, the Bohr model fails when we take the full quantum nature of the electron into account. 



1.8 Spin 

We shall discuss the spin of the electron and its consequences in detail from Chapter 6 onwards. At this 
stage we just need to know the basic facts. Spin is a completely quantum mechanical property with no 
classical equivalent. The Stern-Gerlach experiment (Fig. 1.4) showed that atoms with no orbital angular 
momentum still possess angular momentum.'^ For want of a better name, this angular momentum was 
called "spin" . Paul Dirac at Cambridge successfully accounted for electron spin when he produced the 
relativistic wave equation that bears his name in 1928. 

The fact that there were two deflections in the Stern-Gerlach experiment led to the conclusion that 
the ^-component of the spin can take two possible values. We therefore assign two spin quantum numbers 
to the electron, namely s and ttIj, where s — 1/2 and rus — ±1/2. The magnitude of the spin angular 
momentum is given by 

S=^/s{3 + l)h, (1.26) 
and the component along the z axis is given by 

S,^ = m,h . (1.27) 

The fact that the electron has a half-integer spin makes it a fermion. Fermions obey the Pauli exclusion 
principle. Particles with integer values of the spin (eg a-particles) do not obey the Pauli exclusion 
principle. The Pauli exclusion principle states that only one electron can occupy a particular quantum 
state. This is very important for the explanation of the periodic table of elements, as we shall see in 
Chapter 4. 



Reading 
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^In the Bohr model, the orbital angular momentum L is equal to nh, and since n > 1, L cannot be zero. However, we 
shall see in the next chapter that L is actually equal to + l)h, where I is the orbital quantum number. Since I can 

take values from to n — 1, some atoms can have L = 0. Moreover, we shall see in Chapter 6 that the magnetic dipole 
moment of an atom is directly proportional to its angular momentum. Hence an atom with L = should not be deflected 
by a non-uniform magnetic field if wc do not include the effects of spin. 
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Chapter 2 

The Hydrogen atom 



In the previous chapter we gave a quick overview of the Bohr model, which is only really valid in the 
semiclassical limit, (cf. section 1.7.) We now begin our task in earnest by applying quantum mechanics 
to the simplest atom we know, namely the hydrogen atom. 

It is well known from classical physics that planetary orbits are characterized by their energy and 
angular momentum. In this chapter we apply the Schrodinger equation to the hydrogen atom to find the 
allowed energies and angular momenta of the nucleus-electron system. In classical systems we are also 
able to calculate^ the precise trajectory of the orbit. This is not possible in quantum systems. The best 
we shall be able to do is to find the wave functions. These will then give us the probability amplitudes 
that allow us to calculate all the measurable properties of the system. 



2.1 The Schrodinger Equation 

The time-independent Schrodinger equation for hydrogen is given by: 



- 

2m 47reor 



*(r,6',(/>) = E ^{r,e, 



(2.1) 



This is written in terms of the spherical polar co-ordinates (r, 9, 4>) because atoms are spheres, and the 
use of spherical polar co-ordinates simplifies the solutions. Note that we are considering the motion of 
the electron relative to a stationary nucleus here. As with all two-body problems, this means that the 
mass that enters into the equation is the reduced mass defined previously in eqn 1.9: 



1 

m 



1 1 

— + 

me mN 



(2.2) 



For hydrogen where mN = Wp, the reduced mass is very close to me, and has a value of 0.9995me. 
Written out explicitly, we have 



'2m 



J,2 Qy 



,9* 
dr 



1 



d 



sin 9 89 



sin9 



'89 



+ 



1 



sin^ ( 



47reor 



* = £■ * 



(2.3) 



Our task is to find the wave functions ^{r,9,(j)) that satisfy this equation, and hence to find the allowed 
quantized energies E. 



2.2 Angular momentum 

In classical mechanics, the rate of change of the angular momentum is given by: 
where T is the torque defined by: 

r = rxF. (2.5) 

In the hydrogen atom, the electron is bound to the nucleus by the Coulomb force, which is parallel to 
r. The torque is therefore zero, and so the angular momentum of the electron does not change. This 
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means that the angular momentum is a constant of the motion, and is therefore very important in 
the specification of the quantum states of the atom.^ 
The classical definition of angular momentum is: 

L = r X p. (2.6) 

For circular orbits this simplifies to L = mvr, and in Bohr's model, L was quantized in integer units of h. 
(See eqn 1.7.) However, the fiill quantum treatment is more complicated, and requires the introduction 
of two other quantum numbers / and m;, as we shall now see. 
The components of L are given by 

/x\ /'px\ fyPz-zPv\ 

(2.7) 

In quantum mechanics we represent the linear momentum by differential operators of the type 

= -ih-^ ■ (2.8) 

Therefore, the quantum mechanical operators for the angular momentum are given by: 

f n f d d \ 




Px\ 


I yPz - 


ZPy 


Py 


= \ ZPx- 


XPz 




\xPy ~~ 


ypx 



i \ dz dy 

dy ^ dx) ^ ^ 

Note that the "hat" symbol indicates that we are representing an operator and not just a number. 
The magnitude of the angular momentum is given by: 

, 2 

We therefore define the quantum mechanical operator L by 

^"=11+11 + 11. (2.12) 
Note that operators like L^. should be understood in terms of repeated operations: 



dz'^ dy dz dydz dy"^ 

It can be shown that the components of the angular momentum operator do not commute, that is 

LyLx . 

In fact we can show that: 

[Lx,Ly]=ihL,, (2.13) 
where the "commutator bracket" [Lx,Ly] is defined by 

[Lxj Ly] = LxLy — LyLx ' (^'l^) 

The other commutators of the angular momentum operators, namely [Ly,Lz\ and [Lj;,!/^:] are obtained 
by cyclic permutation of the indices in Eq. 2.13: x ^ y, y ^ z, z ^ x. 

^The starting approximation for the treatment of multi-electron atoms is the central field approximation in which 
we assume that the dominant force is radial (i.e. pointing centrally towards the nucleus), so that the potential only depends 
on r. (See Section 4.1.) In this case the torque is also zero, so that again the angular momentum is constant. 
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|L| = V/(/+i) n 




= rriih 



Figure 2.1: Vector model of the angular momentum in an atom. The angular momentum is represented 
by a vector of length y^l{l + l)h precessing around the z-axis so that the z-component is equal to rriih. 



This rather esoteric point has deep significance. If two quantum mechanical operators do not commute, 
then it is not possible to know their values simultaneously. Consider, for example, the operators for 
position and momentum in a one-dimensional system: 

r- -1, , /dV'A ..d(x-0) 
[x,p\tp — [xp — px)ip — —ihx —— I + in — -j — inip . 



dx I 



Ax 



(2.15) 



Thus we have: 

= ih 7^ . 

The fact that [x,p\ ^ means that the operators do not commute. This is intrinsically linked to the fact 
that we cannot measure precise values for the position and momentum simultaneously, which we know 
from the Heisenberg uncertainty principle. The argument based on commutators is thus a more formal 
way of understanding uncertainty products. 

In the case of the angular momentum operators, the fact that Lj,, Ly and do not commute means 
that we can only know one of the components of L at any time. If we know the value of L^, we cannot 
know and Ly as well. However, we can know the length of the angular momentum vector, because we 
can show that and L^ commute. In summary: 

• We can know the length of the angular momentum vector L and one of its components. 

• For mathematical convenience, we usually take the component we know to be the z component, ie 

• We cannot know the values of all three components of the angular momentum simultaneously. 

This is represented pictorially in the vector model of the atom shown in figure 2.1. In this model the 
angular momentum is represented as a vector of length + \)h precessing around the z axis so that 
the component along that axis is equal to mih. The x and y components of the angular momentum are 
not known. 

In spherical polar co-ordinates, the two key angular momentum operators are given by: 



and 



1 d 



h d 
i d(p 

d 



1 



sin Ode \""^"dej ' sm^ed(f>^ 

It is easy to see that the Schrodinger equation given in Eq. 2.3 can be re-written as follows: 



1 d 
2m dr 



" 2 

L 



The eigenfunctions of the angular momentum operator are found by solving the equation: 



L''F{0,<j)) = -h^ 



1 d 
shi^M 



sm 



de 



1 



SVC? ' 



F{0, 



CF{e, 



(2.16) 



(2.17) 



(2.18) 



(2.19) 
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For reasons that will become clearer later, the constant C is usually written in the form: 

C = l{l + l)h'^ . (2.20) 
At this stage, I can take any value, real or complex. Wc can separate the variables by writing: 

F{0,^) = &{0)<S>{^). (2.21) 
On substitution into eqn 2.19 and cancelling the common factor of fi^, we find: 

' ' ^-^fU- ^6^1=^(1 + 1)9$. (2.22) 



sin6»d6'V d6» y sin^ 9 dc/)"^ 
Multiply by — sin^ 6/0$ and re-arrange to obtain: 



sin 61 d /. ^d0\ . ,,,,, 1 d^^ 

sin^^T^ +sin2 0^(; + l) = -- — . (2.23) 



O d6» V de J ' ' $ d?!>2 ■ 

The left hand side is a function of 9 only, while the right hand side is a function of (p only. The equation 
must hold for all values of the 6 and (p and hence both sides must be equal to a constant. On writing 
this arbitrary separation constant m^, we then find:^ 

sin 61-^ f sin 61^) + l{l + 1) sin^ 9Q = m^O , (2.24) 



d9 V d9 

and 



dH 

The equation in (p is easily solved to obtain: 



d^2 ^'-''^ 



$(0) = Ae™"^ . (2.26) 
The wave function must have a single value for each value of ^, and hence we require: 

$(0 + 27r) = $(</)), (2.27) 

which requires that the separation constant m must be an integer. Using this fact in eqn 2.24, we then 
have to solve 

sin 61-^ fsin6»^) +[/(/ + 1) sin^ 6i - m^] = , (2.28) 



d9\ de , 

with the constraint that m must be an integer. On making the substitution u = cos 6 and writing 
e{9) = P{u), eqn 2.28 becomes: 



P = 0. (2.29) 



Equation 2.29 is known as either the Legendre equation or the associated Legendre equation, depending 
on whether m is zero or not. Solutions only exist if I is an integer > |m| and P{u) is a polynomial function 
of u. This means that the solutions to eqn 2.28 arc of the form: 

9(61) = P;" (cos 6*), (2.30) 

where P™ (cos 9) is a, polynomial function in cos 9 called the (associated) Legendre polynomial function. 
Putting this all together, wc then find: 

F(6', (j)) = normalization constant x P;'"(cos 6*) e'""^ , (2.31) 

where m and I are integers, and m can have values from —I to +1. The correctly normalized functions 
are called the spherical harmonic functions Yim{0,(l>)- 

^Be careful not to confuse the magnetic quantum number m with the reduced mass that has the same symbol! Note 
also that a subscript I is often added (i.e. mj) to distinguish it from the quantum number for the z-component of the spin 
{rris)- 
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I m Yim{0,(t)) 













1 









1 


±1 


T- 




2 







S(3cos2 0-1) 


2 


±1 


T- 


sin6»cos6»e±^'^ 


2 


±2 







Table 2.1: Spherical harmonic functions. 

It is apparent from eqns 2.19 and 2.20 that the spherical harmonics satisfy: 

L\m{0, cj>) = l{l + l)h^Yi^{e, cj>) . (2.32) 
Furthermore, on substituting from eqn 2.16, it is also apparent that 

L,Yim{e, 4,) = mhYimie, </.) . (2.33) 

The integers I and m that appear here arc called the orbital and magnetic quantum numbers respectively. 
Some of the spherical harmonic functions are listed in Table 2.1. Equations 2.32-2.33 show that the 
magnitude of the angular momentum and its 2;-component are equal to + l)h and mh respectively, 
as consistent with Fig. 2.1. 

The spherical harmonics have the property that: 

/•TT /'27r 

/ / Yi*^{e,4>)Yi,^,{e,4,) sm6ded<l> = 5i,v5,n,m'. (2.34) 
Je=o J4>=o 

The symbol 5k.k' is called the Kronecker delta function. It has the value of 1 if /c = fc' and if fc 7^ fc'. 
The sinO factor in Eq. 2.34 comes from the volume increment in spherical polar co-ordinates: see Eq. 2.49 
below. 



2.3 Separation of variables in the Schrodinger equation 

The Coulomb potential is an example of a central field. This means that the force only lies along the 
radial direction. This allows us separate the motion into the radial and angular parts. Hence we write: 

*(r, e, (p) = R{r) Y{e, <j)) . (2.35) 

On substituting this into Eq. 2.18, we find 

~-.^{r''f]y^^$^.~Ry=ERy- (2-36) 

2m dr \ ar J 2mr'^ Awcor 

Multiply by r'^/RY and re-arrange to obtain: 

^2, 



I A f 2di?\ ZeV 2 1 ^ ^ 



2m R dr V' dr J A-kcq Y 2m ' ^^"^^^ 



The left hand side is a function of r only, while the right hand side is only a function of the angular 
co-ordinates 9 and cj). The only way this can be true is if both sides are equal to a constant. Let's call 



14 



CHAPTER 2. THE HYDROGEN ATOM 



this constant + l)/2m, where £ is an arbitrary number at this stage. This gives us, after a bit of 

re- arrangement : 

fi2 1 d / 2di?(r)\ h^£{£+l)^. . Ze^ 



r 



R{r) - R{r) = ER{r) , (2.38) 



2m dr \ dr J 2mr^ Ancor 
and 

L Y{e, (/)) = n^£{£ + 1)F(6», 0) . (2.39) 

On comparing Eqs. 2.32 and 2.39 wc can now identify the arbitrary separation constant I with the angular 
momentum quantum number /, and we can see that the function Y(Q, </>) that enters Eq. 2.39 must be 
one of the spherical harmonics. 

We can tidy up the radial equation Eq. 2.38 by writing: 



This gives: 

d^ hH{i + i) ze^ 



P(r) = EP{r) . (2.40) 



2to dr^ 2mr'^ Aireor 
This now makes physical sense. It is a Schrodinger equation of the form: 

HP{r) = EP{r) , (2.41) 

where the energy operator H (i.e. the Hamiltonian) is given by: 

H = -7^:r^+ V;ffective(r-) . (2.42) 
2to dr^ 

The first term in eqn 2.42 is the radial kinetic energy given by 

KF 

l^.J^>.radial - 2^ - dr^ ' 

The second term is the effective potential energy: 

. X hHil + l) Ze^ 
Veffcctive(r) = ^ , ' - , (2.43) 

which has two components. The first of these is the orbital kinetic energy given by: 

_L^ _ hH{l + l) 
lV.Ji..orbital - 2j - 2mr2 ' 

where I = mr^ is the moment of inertia. The second is the usual potential energy due to the Coulomb 
energy. 

This analysis shows that the quantized orbital motion adds quantized kinetic energy to the radial 
motion. For I > the orbital kinetic energy will always be larger than the Coulomb energy at small r, 
and so the effective potential energy will be positive. This has the effect of keeping the electron away 
from the nucleus, and explains why states with / > have nodes at the origin (see below). 



2.4 The wave functions and energies 

The wave function we require is given by Eq. 2.35. We have seen above that the Y{9, cj)) function that 
appears in Eq. 2.35 must be one of the spherical harmonics, some of which are listed in Table 2.1. The 
radial wave function R{r) can be found by solving the radial differential equation given in Eq. 2.38. The 
mathematics is somewhat complicated and is considered in Section 2.5. Here we just quote the main 
results. 

Solutions are only found if we introduce an integer quantum number n. The energy depends only 
on n, but the functional form of R{t) depends on both n and /, and so we must write the radial wave 
function as Rni{r). A list of some of the radial functions is given in Table 2.2. Representative wave 
functions are plotted in Fig. 2.2. 

We can now write the full wave function as: 

^nimir, e, (t>) = Rni{r)Yim{e, (j)) . (2.44) 
The quantum numbers must obey the following rules: 



2.4. THE WAVE FUNCTIONS AND ENERGIES 
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Figure 2.2: The radial wave functions i?„;(r) for the hydrogen atom with Z = 1. Note that the axes for 
the three graphs are not the same. 



n I Rni{r) 



1 (Z/ao)5 2exp(-Zr/ao) 

2 (Z/2ao)t2(l- ^)exp(-Zr/2ao) 
2 1 (Z/2ao)t^(^)exp(-Zr/2ao) 



3 (Z/3ao)5 2 



l-i2Zr/3ao) + l{i^y 



exp(— Zr/3ao) 



(Z/3ao) 5(4^2/3) (^) (l - exp(-Zr/3ao) 
(Z/3ao)^ (2^2/3^) (^) ' exp(-Zr/3ao) 



Table 2.2: Radial wave functions of the hydrogen atom, uq is the Bohr radius (5.29 x 10 ^^m). The 
wave functions are normalized so that J^^^ R^iRnir^dr ~ 1. 



16 



CHAPTER 2. THE HYDROGEN ATOM 



z z z 




Figure 2.3: Polar plots of the spherical harmonics with I < 2. The plots are to be imagined with 
spherical symmetry about the z axis. In these polar plots, the value of the function for a given an- 
gle 6 is plotted as the distance from the origin. Prettier pictures may be found, for example, at: 
http : //mathworld . wolfram . com/SphericalHarmonic . html. 



• n can have any integer value > 1. 

• I can have integer values up to {n — 1). 

• m can have integer values from —I to +1. 

These rules drop out of the mathematical solutions. Functions that do not obey these rules will not 
satisfy the Schrodinger equation for the hydrogen atom. 

The radial wave functions listed in Table 2.2 are of the form: 

Rni{r) = Cni ■ (polynomial in r) • e"-/" , (2.45) 

where a — na^i/Z, an being the Bohr radius of Hydrogen, namely 5.29 x 10~^^ m. Cni is a normalization 
constant. The polynomial functions that drop out of the equations are polynomials of order n — 1, and 
have n — 1 nodes. If / = 0, all the nodes occur at finite r, but if Z > 0, one of the nodes is at r = 0. 
The angular part of the wave function is of the form (see eqn 2.31 and Table 2.1): 

Yi^ie, 0) = • Pr (cos 9) ■ e™^ , (2.46) 

where P™{cos9) is a Legendre polynomial, e.g. Pi{cos9) — constant, P{'(cos6') = cos6', etc. C;^ is 
another normalization constant. Representative polar wave functions are shown in figure 2.3. 
The energy of the system is found to be: 

mZ'^e* 1 

which is the same as the Bohr formula given in Eq. 1.10. Note that this depends only on the principal 
quantum number n: all the I states for a given value of n are degenerate (i.e. have the same energy), 
even though the radial wave functions depend on both n and I. This degeneracy with respect to I is called 
"accidental", and is a consequence of the fact that the electrostatic energy has a precise 1/r dependence 
in hydrogen. In more complex atoms, the electrostatic energy will depart from a pure 1/r dependence 
due to the shielding effect of inner electrons. In this case, the gross energy depends on / as well as n, 
even before we start thinking of higher order fine structure effects. We shall see how this works in more 
detail when we consider the alkali atoms later. 
The wave functions are nomalized so that 

*n,i.m*n',i',m' AV = Sn.n' Sl,l' S^,m' (2.48) 

e=o J0=o 

where d^ is the incremental volume element in spherical polar co-ordinates: 

dV = r^sm9drd9d(j}. (2.49) 
The radial probability function P„i(r) is the probability that the electron is found between r and 7' + dr: 

P„,(r)dr = / / r^sm9drd9d(f> 
Je=o J<p=o 

= \Rniir)\^ dr . (2.50) 
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The factor of that appears here is just related to the surface area of the radial shell of radius r (i.e. 
47rr^.) Some representative radial probability functions are sketched in Fig. 2.4. 3-D plots of the shapes 
of the atomic orbitals are available at: http://www.shef.ac.uk/cheniistry/orbitron/. 
Expectation values of measurable quantities are calculated as follows: 



(A) = JJJ W*A'^dV . (2.51) 
Thus, for example, the expectation value of the radius is given by: 
(r) = jjj ^*rmV 

fOO nTV p27r 



/■OO fTV pZTv 

/ RlirR^ir^dr / l^*„(e, <^)y,„(^, sinMM,^ 

Jr=Q Je=OJcl>=Q 

RlirRnirMr. (2.52) 



This gives: 



Note that this only approaches the Bohr value, namely n^ayi/Z (see eqn 1.15), for the states with I = n—1 
at large n. 



Reading 

Demtroder, W., Atoms, Molecules and Photons, §4.3 - §5.1. 

Haken, H. and Wolf, H.C., The Physics of Atoms and Quanta, chapter 10. 

Phillips, A.C., Introduction to Quantum Mechanics, chapters 8 & 9. 

Beisser, A., Concepts of Modern Physics, chapter 6. 

Eisberg, R. and Resnick, R., Quantum Physics, chapter 7. 
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Figure 2.4: Radial probability functions for the first three n states of the hydrogen atom with Z = I. 
Note that the radial probability is equal to r^|-R„;(r)p, not just to |i?„;(r)p. Note also that the horizontal 
axes are the same for all three graphs, but not the vertical axes. 



2.5. APPENDIX: MATHEMATICAL SOLUTION OF THE RADIAL EQUATION 
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2.5 Appendix: Mathematical solution of the radial equation 

The radial wave equation for hydrogen is given from eqn 2.38 as: 

i r , H f — ^R{r) - R(r) = ERir) , (2.54) 

2m r2 dr V dr J 2mr^ ^ ' 47reor ^ ' ^ ' ^ ' 

where / is an integer > 0. We first put this in a more user-friendly form by introducing the dimensionless 
radius p according to: 

%m\E\^^''^ 



P=\^-^) r. (2.55) 

The modulus sign around E is important here because we are seeking bound solutions where E is negative. 
The radial equation now becomes: 



A^R 2 di? _/;(? + !) A_l 
dp^ p dp \ P 4 



,2 +;,^-^^ + ^-7)^ = o. (2-56) 



where 

We first consider the behaviour at p ^ oo, where eqn 2.56 reduces to: 

d?R 1 
dp2 4 



2--i? = 0. (2.58) 



This has solutions of e^^/^. The e+^'Z^ solution cannot be normalized and is thus excluded, which implies 
that R{p) - e-P/^. 

Now consider the behaviour for p — >■ 0, where the dominant terms in eqn 2.56 are: 



d^R ^2dR _l{l + 1) 
dp^ p dp p 



2 ' ... .2 ^ = 0' (2-59) 



with solutions R{p) = p' or R{p) = p~*^'+^'. The latter diverges at the origin and is thus unacceptable. 

The consideration of the asymptotic behaviours suggests that we should look for general solutions of 
the radial equation with R{p) in the form: 

R{p) = L{p) p' e-''/^ (2.60) 

On substituting into eqn 2.56 we find: 

d^L /21 + 2 _\dL 
dp2 V p y dp p 

We now look for a series solution of the form: 



2+1^^ + : — ^ = 0- (2-61) 



L(p)=^afcp'=. (2.62) 

Substitution into eqn 2.61 yields: 

V \k{k - l)a,p'=-2 + f ^ - l) fcafep'^-i + ^~^~^ aupA = , (2.63) 
which can be re-written: 

oo 

^ [{k{k - 1) + 2k{l + l))afcp'=-^ + (A - Z - 1 - k)akp''-^] = , (2.64) 

fe=0 

or alternatively: 

oo 

^ [{{k + l)k + 2{k + 1){1 + l))afe+ip'=-i + (A - / - 1 - fc)afep'=-i] = . (2.65) 
fc=o 
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This will be satisfied if 

{{k + l)k + 2{k + + l))ak+i + (A - / - 1 - k)ak = , (2.66) 

which implies: 



a/s+i -A + Z + 1 + fc 



At large k we have: 

Now the series expansion of is 



ak (fc + l)(A: + 2Z + 2) 



ak k 



i2m) 

(2.68) 



e^ = l + P+^ + ---^ + --- , (2.69) 

which has the same limit for Uk+i/ak- With R{p) given by eqn 2.60, we would then have a dependence 
of e^'' • e"''/^ = e"'"''/^, which is unacceptable. We therefore conclude that the series expansion must 
terminate for some; value of k. Let rir be the value of k for which the series terminates. It then follows 
that a„^+i = 0, which implies: 

-A + Z + l + nr = 0, nr>0, (2.70) 

or 

X = l + l + n^. (2.71) 
We now introduce the principal quantum number n according to: 

n = n, + l + l. (2.72) 

It follows that: 

1. n is an integer, 

2. n>l + l, 

3. A = n . 

The first two points establish the general rules for the quantum numbers n and I. The third one fixes the 
energy. On inserting A = n into eqn 2.57 and remembering that E is negative, we find: 

"~ (47reo)22?l2 n2 • i^-^"^^ 

This is the usual Bohr result. The wave functions are of the form given in eqn 2.60: 

R{p)=p'L{p)e-^/\ (2.74) 

The polynomial series L{p) that satisfies eqn 2.61 is known as an associated Laguerre function. On 
substituting for p from eqn 2.55 with \E\ given by eqn 2.73, we then obtain: 

R{r) = normalization constant x Laguerre polynomial in r x r''e~^^"' (2-75) 

as before (cf. eqn 2.45), with 



/j2 \i/2 47reo^^ n n 



2m\E\J me^ 
where an is the Bohr radius of hydrogen. 



an , (2.76) 



Chapter 3 

Radiative transitions 



In this chapter we shall look at the classical and quantum theories of radiative emission and absorption. 
This will enable us to derive certain selection rules which determine whether a particular transition is 
allowed or not. Wc shall also investigate the physical mechanisms that affect the shape of the spectral 
lines that are observed in atomic spectra. 

3.1 Classical theories of radiating dipoles 

The classical theories of radiation by atoms were developed at the end of the 19*^ century before the 
discoveries of the electron and the nucleus. With the benefit of hindsight, we can understand more clearly 
how the classical theory works. We model the atom as a heavy nucleus with electrons attached to it by 
springs with different spring constants, as shown in Fig. 3.1(a). The spring represents the binding force 
between the nucleus and the electrons, and the values of the spring constants determine the resonant 
frequencies of each of the electrons in the atom. Every atom therefore has several different natural 
frequencies. 

The nucleus is heavy, and so it does not move very easily. However, the electrons can readily vibrate 
about their mean position, as illustrated in Fig. 3.1(b). The vibrations of the electron create a fluctuating 
electric dipole. In general, electric dipoles consist of two opposite charges of ±q separated by a distance 
d. The dipole moment p is defined by: 



where d is a vector of length d pointing from — g to +q. In the case of atomic dipoles, the positive charged 
is fixed, and so the time dependence of p is just determined by the movement of the electron: 



where x{t) is the time dependence of the electron displacement. 

It is well known that oscillating electric dipoles emit electromagnetic radiation at the oscillation 
frequency. This is how aerials work. Thus we expect an atom that has been excited into vibration to 
emit light waves at one of its natural resonant frequencies. This is the classical explanation of why atoms 
emit characteristic colours when excited electrically in a discharge tube. Furthermore, it is easy to see 
that an incoming light wave of frequency cjo can drive the natural vibrations of the atom through the 
oscillating force exerted on the electron by the electric field of the wave. This transfers energy from 
the light wave to the atom, which causes absorption at the resonant frequency. Hence the atom is also 
expected to absorb strongly at its natural frequency. 

The classical theories actually have to assume that each electron has several natural frequencies of 
varying strengths in order to explain the observed spectra. If you do not do this, you end up predicting, 
for example, that hydrogen only has one emission frequency. There was no classical explanation of the 
origin of the atomic dipoles. It is therefore not surprising that we run into contradictions such as this 
when we try to patch up the model by applying our knowledge of electrons and nuclei gained by hindsight. 

3.2 Quantum theory of radiative transitions 

We have just seen that the classical model can explain why atoms emit and absorb light, but it does not 
offer any explanation for the frequency or the strength of the radiation. These can only be calculated 



p = qd 



(3.1) 



p{t) = —ex{t) 



(3.2) 
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Figure 3.1: (a) Classical atoms consist of electrons bound to a heavy nucleus by springs with characteristic 
force constants, (b) The vibrations of an electron in an atom at its natural resonant frequency ujq creates 
an oscillating electric dipole. This acts like an aerial and emits electromagnetic waves at frequency 
ujQ. Alternatively, an incoming electromagnetic wave at frequency loq can drive the oscillations at their 
resonant frequency. This transfers energy from the wave to the atom, which is equivalent to absorption. 

by using quantum theory. Quantum theory tells us that atoms absorb or emit photons when they jump 
between quantized states, as shown in figure 3.2(a). The absorption or emission processes are called 
radiative transitions. The energy of the photon is equal to the difference in energy of the two levels: 

hiy^ E2~Ei. (3.3) 

Our task here is to calculate the rate at which these transitions occur. 

The transition rate W12 can be calculated from the initial and final wave functions of the states 
involved by using Fermi's golden rule: 

M^i2 = ^|Mi2p5(H, (3.4) 
n 

where M12 is the matrix element for the transition and g{hv) is the density of states. The matrix 
element is equal to the overlap integral^: 

Afi2 = j r2{r)H'{r)^^{r)d?r. (3.5) 

where H' is the perturbation that causes the transition. This represents the interaction between the 
atom and the light wave. There are a number of physical mechanisms that cause atoms to absorb or emit 
light. The strongest process is the electric dipole (El) interaction. We therefore discuss El transitions 
first, leaving the discussion of higher order effects to Section 3.5. 

The density of states factor is defined so that g{hv)(lE is the number of final states per unit volume 
that fall within the energy range E to E + AE, where E = hv. In the standard case of transitions between 
quantized levels in an atom, the initial and final electron states are discrete. In this case, the density of 
states factor that enters the golden rule is the density of photon states.^ In free space, the photons can 
have any frequency and there is a continuum of states available, as illustrated in Fig. 3.2(b). The atom 
can therefore always emit a photon and it is the matrix element that determines the probability for this 
to occur. Hence we concentrate on the matrix element from now on. 



3.3 Electric dipole (El) transitions 

Electric dipole transitions are the quantum mechanical equivalent of the classical dipole oscillator dis- 
cussed in Section 3.1. We assume that the atom is irradiated with light, and makes a jump from level 1 

^This is sometimes written in the shorthand Dirac notation as A/12 = (2|-ff'|l). 

^In solid-state physics, we consider transitions between electron bands rather than between discrete states. We then have 
to consider the density of electron states as well as the density of photon states when we calculate the transition rate. This 
point is covered in other courses, e.g. PHY475: Optical properties of solids. 



3.3. ELECTRIC DIPOLE (El) TRANSITIONS 
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Figure 3.2: (a) Absorption and emission transitions in an atom, (b) Emission into a continuum of photon 
modes during a radiative transition between discrete atomic states. 



to 2 by absorbing a photon. The interaction energy between an electric dipole p and an external electric 
field £ is given by 

E = ~pS. (3.6) 

We presume that the nucleus is heavy, and so we only need to consider the effect on the electron. Hence 
the electric dipole perturbation is given by: 



H' = +er£, 



(3.7) 



where r is the position vector of the electron and £ is the electric field of the light wave. This can be 
simplified to: 

H' ^eix£, + y£y + z£,), (3.8) 

where £x is the component of the field amplitude along the x axis, etc. Now atoms are small compared 
to the wavelength of light, and so the amplitude of the electric field will not vary significantly over the 
dimensions of an atom. We can therefore take £x, £y, and £ z in Eq. 3.8 to be constants in the calculation, 
and just evaluate the following integrals: 



Mi2 oc j ^ix^l)2A^r 
Mi2 oc J ^piyi)2d^r 
Mi2 oc / ipiZTp2<i^r 



a;— polarized light , 
y— polarized light , 
z— polarized light . 



(3.9) 



Integrals of this type are called dipole moments. The dipole moment is thus the key parameter that 
determines the transition rate for the electric dipole process. 

At this stage it is helpful to give a hand-waving explanation for why electric dipole transitions lead to 
the emission of light. To do this we need to to consider the time-dependence of the quantum mechanical 
wave functions. This naturally drops out of the time- dependent Schrodinger equation: 



i?(r)*(r,t) =i;i^*(r,t), 

where H{r) is the Hamiltonian of the system. The solutions of Eq. 3.10 are of the form: 

*(r,i) = V(r)e-'^*/'', 
where ipir) satisfies the time-independent Schrodinger equation: 

H{r)i}}{r) = E^{r). 



(3.10) 



(3.11) 



(3.12) 



During a transition between two quantum states of energies Ei and E2, the electron will be in a super- 
position state with a mixed wave function given by 



*(r,t) = ci^'i(r,t)-Fc2«'2(r,t) 

= ci Vi (r)e^'^i*/'"' + C2^2 (r)e-'^^*/'' , 



(3.13) 



where ci and C2 are the mixing coefficients. The expectation value {x) of the position of the electron is 
given by: 

**a;*dV. (3.14) 
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Quantum number Selection rule 



parity 


changes 


I 


A/ = ±1 


m 


Am = 0,±1 




Am = 




Am = ±l 




Am = +1 




Am = -1 


s 


As = 


rris 


Ams = 



unpolarizcd light 

linear polarization || z 

linear polarization in [x, y) plane 

(T"*" circular polarization 

a~ circular polarization 



Table 3.1: Electric dipole selection rules for the quantum numbers of the states involved in the 
transition. 

With given by Eq. 3.13 we obtain: 

{x) = c\ci j 11)1 X 11)1 r + c*2C2 j tp2Xtp2d^r (3.15) 



This shows that if the dipole moment defined in Eq. 3.9 is non-zero, then the electron wave-packet 
oscillates in space at angular frequency {E2 — Ei)/h. The oscillation of the electron wave packet creates 
an oscillating electric dipole, which then radiates light at angular frequency {E2 — Ei)/h. Hey presto! 



3.4 Selection rules for El transitions 

Electric dipole transitions can only occur if the selection rules simimarizcd in Table 3.1 arc satisfied. 
Transitions that obey these El selection rules are called allowed transitions. If the selection rules are 
not satisfied, the matrix element (i.e. the dipole moment) is zero, and we then see from Eq. 3.4 that the 
transition rate is zero. The origins of these rules are discussed below. 



Parity 

The parity of a function refers to the sign change under inversion about the origin. Thus if /(— x) = f{x) 
we have even parity, whereas if f{—x) = —f{x) we have odd parity. Now atoms are spherically symmetric, 
which implies that 

|V'(-r)p = |V.(+r)p. (3.16) 

Hence we must have that 

V'(-r) = ±i){+r) . (3.17) 

In other words, the wave functions have cither even or odd parity. The dipole moment of the transition 
is given by Eq. 3.9. x, y and z are odd functions, and so the product V'i''/'2 must be an odd function if 
M12 is to be non-zero. Hence V'l and must have different parities. 



The orbital quantum number I 

The parity of the spherical harmonic functions is equal to (—1)'. Hence the parity selection rule implies 
that A^ must be an odd number. Detailed evaluation of the overlap integrals tightens this rule to A^ = ±1. 
This can be seen as a consequence of the fact that the angular momentum of a photon is ±fi., with the 
sign depending on whether we have a left or right circularly polarized photon. Conservation of angular 
momentum therefore requires that the angular momentum of the atom must change by one unit. 



3.5. HIGHER ORDER TRANSITIONS 
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The magnetic quantum number m 

The dipole moment for the transition can be written out expHcitly: 

/•DC /.IT f.2ll 

Mi2« / / / r sin 0drdM0. (3.18) 

Jr=0 J 6=0 J<j>=0 



We consider here just the (p part of this integral: 



because z = r cos 6. Hence we must have that m' 
in the (a;, y) plane, we have integrals like 



(3.19) 
(3.20) 
(3.21) 

= m if Mi2 is to be non-zero. If the light is polarized 



2tt 

M12OC / e-^^'-^re^^dc/), 



where we have made use of the fact that (see Eq. 2.46 or Table 2.1): 

*„,/,™(r,^,</')cxe™^ 
Now for 2:-polarized light we have from Eq. 3.9: 



/'27r pliT 

Mi2(x / e-™''^^e™'^d(/)(x / e"™'"^ • 1 • e™"^ d.?!) 
Jo Jo 



i''2'K pI'K 

Mi2 0^ / e-^'-^xe^-^d^ia / e"™'"^ • e^^'"^ • e™"^ d^i . (3.22) 
Jo Jo 

This is because x = rsmO &xi<p = r sin (?i(e+"^ + c^"^), and similarly for y. This give m' — m = ±1. This 
rule can be tightened up a bit by saying that Am = +1 for cr+ circularly polarized light and Am = — 1 for 
a~ circularly polarized light. If the light is unpolarized, then all three linear polarizations are possible, 
and we can have Am = 0, ±1. 

Spin 

The photon does not interact with the electron spin. Therefore, the spin state of the atom does not 
change during the transition. This implies that the spin quantum numbers s and are unchanged. 



3.5 Higher order transitions 

How does an atom de-excite if El transitions are forbidden by the selection rules? In some eases it 
may be possible for the atom to de-excite by alternative methods. For example, the 3s — >■ Is transition 
is forbidden, but the atom can easily de-excite by two allowed El transitions, namely 3s — >■ 2p, then 
2p — >■ Is. However, this may not always be possible, and in these cases the atom must de-excite by 
making a forbidden transition. The use of the word "forbidden" is somewhat misleading here. It really 
means "electric-dipole forbidden" . The transitions are perfectly possible, but they just occur at a slower 
rate. 

After the electric-dipole interaction, the next two strongest interactions between the photon and 
the atom give rise to magnetic dipole (Ml) and electric quadrupole (E2) transitions. There have 
different selection rules to El transitions (e.g. parity is conserved), and may therefore be allowed when 
when El transitions are forbidden. Ml and E2 transitions are second-order processes and have much 
smaller probabilities than El transitions. 

In extreme cases it may happen that all types of radiative transitions are forbidden. In this case, the 
excited state is said to be metastable, and must de-excite by transferring its energy to other atoms in 
coUisional processes or by multi-photon emission. 



3.6 Radiative lifetimes 

An atom in an excited state has a spontaneous tendency to de-excite by a radiative transition involving 
the emission of a photon. This follows from statistical physics: atoms with excess energy tend to want 
to get rid of it. This process is called spontaneous emission. Let us suppose that there are N2 atoms 
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Transition 


Einstein A coefficient 


Radiative lifetime 


El allowed 


10^ - 10^ s-i 


1 - 10 ns 


El forbidden (Ml or E2) 


10^ - 10*5 s-i 


1 yUS - 1 ms 



Tabic 3.2: Typical transition rates and radiative lifetimes for allowed and forbidden transitions at optical 
frequencies. 

in level 2 at time t. We use quantum mechanics to calculate the transition rate from level 2 to level 1, 
and then write down a rate equation for N2 as follows: 

dNo 

-^ = -AN2. (3.23) 

This merely says that the total number of atoms making transitions is proportional to the number of 
atoms in the excited state and to the quantum mechanical probability. The parameter A that appears in 
eqn 3.23 is called the Einstein A coefficient of the transition. The Einstein B coefficients that describe 
the processes of stimulated emission and absorption are considered in Section 9.3 in the context of laser 
physics. 

Equation 3.23 has the following solution: 

N2{t) = N2{0)cxp{-At) 

= 7V2(0)cxp(-t/T), (3.24) 

r=j- (3.25) 

Equation 3.24 shows that if the atoms are excited into the upper level, the population will decay due to 
spontaneous emission with a time constant r. r is thus called the natural radiative lifetime of the 

excited state. 

The values of the Einstein A coefficient and hence the radiative lifetime t vary considerably from 
transition to transition. Allowed El transitions have A coefficients in the range 10^ — lO^s"^ at optical 
frequencies, giving radiative lifetimes of ^ 1 — 10 ns. Forbidden transitions, on the other hand, arc much 
slower because they are higher order processes. The radiative lifetimes for Ml and E2 transitions are 
typically in the millisecond or microsecond range. This point is summarized in Table 3.2. 

3.7 The width and shape of spectral hnes 

The radiation emitted in atomic transitions is not perfectly monochromatic. The shape of the emission 
line is described by the spectral line shape function 5(1^). This is a function that peaks at the line 
centre defined by 

hPo = {E2-Ei), (3.26) 



where 



and is normalized so that: 



00 



g{i^)du = l. (3.27) 







The most important parameter of the line shape function is the full width at half mEtximum (FWHM) 
Au, which quantifies the width of the spectral line. We shall see below how the different types of line 
broadening mechanisms give rise to two common line shape functions, namely the Lorentzian and 
Gaussian functions. 

In a gas of atoms, spectral lines are broadened by three main processes: 

• natural broadening, 

• collision broadening, 

• Doppler broadening. 



3.8. NATURAL BROADENING 
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Wc shall look at each of these processes separately below. A useful general division can be made at this 
stage by classifying the broadening as either homogeneous or inhomogeneous. 

• Homogeneous broadening affects all the individual atoms in the same way. Natural lifetime and 
collision broadening are examples of homogeneous processes. All the atoms are behaving in the 
same way, and each atom produces the same emission spectrum. 

• Inhomogeneous broadening affects different individual atoms in different ways. Doppler broad- 
ening is the standard example of an inhomogeneous process. The individual atoms are presumed to 
behave identically, but they are moving at different velocities, and one can associate different parts 
of the spectrum with the subset of atoms with the appropriate velocity. Inhomogeneous broadening 
is also found in solids, where different atoms may experience different local environments due to 
the inhomogeneity of the medium. 



3.8 Natural broadening 

We have seen in Section 3.6 that the process of spontaneous emission causes the excited states of an 
atom to have a finite lifetime. Let us suppose that we somehow excite a number of atoms into level 2 
at time t = 0. Equation 3.23 shows us that the rate of transitions is proportional to the instantaneous 
population of the upper level, and eqn 3.24 shows that this population decays exponentially. Thus the 
rate of atomic transitions decays exponentially with time constant r. For every transition from level 2 to 
level 1, a photon of angular frequency ujq = {E2 — Ei)/h is emitted. Therefore a burst of light with an 
exponentially-decaying intensity will be emitted for t > 0: 

I{t) = /(O) exp(-f/r) . (3.28) 

This corresponds to a time dependent electric field of the form: 

i < : £{t) = , 

i>0: £{t) = f e''""* e-*/2^ . (3.29) 

The extra factor of 2 in the exponential in eqn 3.29 compared to eqn 3.28 arises because I{t) oc f (i)^. 
We now take the Fourier transform of the electric field to derive the frequency spectrum of the burst: 

1 /■+°° 

£{uj) = ^ 5(f)e'"*dt. (3.30) 
v27r J-00 

The emission spectrum is then given by: 

^^--)"l^(--)> (.-.o)-V(l/2r)^ - ^'-''^ 
Remembering that oj = 2ny, we find the final result for the spectral line shape function: 

^('-^ = ^ (.-.o)- + (A./2)^ ' (^-^2) 
where the full width at half maximum is given by 

A.= ^. (3.33) 

The spectrum described by eqn 3.32 is called a Lorentzian line shape. This function is plotted in 
Fig. 3.3. Note that we can re-write eqn 3.33 in the following form: 

Au.T=^. (3.34) 

By multiplying both sides by h, we can recast this as: 

AE-T = h/2Tr . (3.35) 



If we realize that r represents the average time the atom stays in the excited state (i.e the uncertainty 
in the time), we can interpret this as the energy— time uncertainty principle. 
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g(v) 




> 



1 



area = 1 



FWHM = 1 / 271X 







-3-2-10123 



>- (v-Vo) 



(v-Vq) in units of l/27i;x 



Figure 3.3: The Lorentzian line shape. The functional form is given in eqn 3.32. The function peaks 
at the line centre vq and has an FWHM of l/27rT. The function is normalized so that the total area is 
unity. 

3.9 Collision (Pressure) broadening 

The atoms in a gas jostle around randomly and frequently collide into each other and the walls of the 
containing vessel. This interrupts the process of light emission and effectively shortens the lifetime of the 
excited state. This gives additional line broadening through the uncertainty principle, as determined by 
eqn 3.33 with r replaced by Tc, where Tc is the mean time between collisions. 

It can be shown from the kinetic theory of gases that the time between collisions in an ideal gas is 
given by: 



where Us is the collision cross-section, and P is the pressure. The collision cross-section is an effective 
area which determines whether two atoms will collide or not. It will be approximately equal to the size 
of the atom. For example, for sodium atoms we have: 



Thus at S.T.P we find Tc ^ 6 x 10^ ^'^ s, which gives a line width of ^ 1 GHz. Note that Tc is much shorter 
than typical radiative lifetimes. For example, the strong yellow D-lines in sodium have a radiative lifetime 
of 16 ns, which is nearly two orders of magnitude larger. 

In conventional atomic discharge tubes, we reduce the effects of pressure broadening by working at 
low pressures. We see from eqn 3.36 that this increases Tc, and hence reduces the linewidth. This is why 
we tend to use "low pressure" discharge lamps for spectroscopy. 

3.10 Doppler broadening 

The spectrum emitted by a typical gas of atoms in a low pressure discharge lamp is usually found to be 
much broader than the radiative lifetime would suggest, even when everything is done to avoid collisions. 
For example, the radiative lifetime for the 632.8 nm line in neon is 2.7 x 10^^ s. Equation 3.33 tells us 
that we should have a spectral width of 0.54 MHz. In fact, the line is about three orders of magnitude 
broader, and moreover, does not have the Lorentzian lineshape given by eqn 3.32. 

The reason for this discrepancy is the thermal motion of the atoms. The atoms in a gas move about 
randomly with a root-mean-square thermal velocity given by: 




(3.36) 



.2 

atom 



TT X (0.2nm)2 = 1.2 x lO^^^m^ . 



(3.37) 



where fee is Boltzmann's constant. At room temperature the thermal velocities are quite large. For 
example, for sodium with a mass number of 23 we find Vx ^ 330 ms^^ at 300 K. This random thermal 
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atom moving at right angles 
to the observer 
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■> v 



Figure 3.4: The Doppler broadening mechanism. The thermal motion of the atoms causes their frequency 
to be shifted by the Doppler effect. 



motion of the atoms gives rise to Doppler shifts in the observed frequencies, which then cause line 
broadening, as illustrated in Fig. 3.4. This is Doppler line broadening mechanism. 

Let us suppose that the atom is emitting light from a transition with centre frequency vq. An atom 
moving with velocity Vx will have its observed frequency shifted by the Doppler effect according to: 

u^iyo{l±'^) , (3.38) 

where the + and — sign apply to motion towards or away from the observer respectively. The number of 
atoms with a particular velocity N(vx) is given by the Maxwell-Boltzmann distribution: 



N{vx) - No 



2kBT 



1/2 



exp 



We can combine eqns 3.38 and 3.39, to find the number of atoms emitting at frequency v. 



N{iy) = No 



f2kBT 



1/2 



exp 



mc^{v — voY 



\ vrm J \ 2A;bTi'q 

The frequency dependence of the light emitted is therefore given by: 



7(.)aexp(^- 2fcBT.o2 
This gives rise to a Gaussian line shape with g{v) given by: 



g{v) oc exp 

with a full width at half maximum equal to: 



2fcBTz.^ 



1/2 



2 ^(21n2)fcBT 



1/2 



(3.39) 



(3.40) 



(3.41) 



(3.42) 



(3.43) 



The Doppler linewidth in a gas at S.T.P is usually several orders of magnitude larger than the natural 
linewidth. For example, the Doppler line width of the 632.8 nm line of neon at 300 K works out to 
be 1.3 GHz, i.e. three orders of magnitude larger than the broadening due to spontaneous emission. 
The dominant broadening mechanism in the emission spectrum of gases at room temperature is usually 
Doppler broadening, and the line shape is closer to Gaussian than Lorentzian. 



3.11 Atoms in solids 

In laser physics we shall frequently be interested in the emission spectra of atoms in crystals. The spectra 
will be subject to lifetime broadening as in gases, since this is a fundamental property of radiative 

^Since Ai^o is proportional to y/T, we can reduce its value by cooling the gas. Cooling also reduces the collision 
broadening because P oc T, and therefore Tc oc T~^^^ . (See eqn 3.36.) Laser cooling techniques can produce temperatures 
in the micro-Kelvin range, where we finally observe the natural line shape of the emission line. 
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emission. However, the atoms are locked in a lattice, and so coUisional broadening is not relevant. 
Doppler broadening does not occur either, for the same reason. On the other hand, the emission lines 
can be broadened by other mechanisms. 

In some cases it may be possible for the atoms to de-excite from the upper level to the lower level 
by making a non-radiative transition. One way this could happen is to drop to the lower level by 
emitting phonons (ie heat) instead of photons. To allow for this possibility, we must re-write eqn 3.23 in 
the following form: 



where tnr is the non-radiative transition time. This shows that non-radiative transitions shorten the 
lifetime of the excited state according to: 



We thus expect additional lifetime broadening according to eqn 3.33. The phonon emission times in solids 
are often very fast, and can cause substantial broadening of the emission lines. This is the solid-state 
equivalent of coUisional broadening. 

Another factor that may cause line broadening is the inhomogencity of the host medium, for example 
when the atoms are doped into a glass. If the environment in which the atoms find themselves is not 
entirely uniform, the emission spectrum will be affected through the interaction between the atom and 
the local environment. This is an example of an inhomogeneous broadening mechanism. 
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Chapter 4 

The shell model and alkali spectra 



Everything wo have boon doing so far in this course apphes to hydrogcnic atoms. We have taken this 
approach because the hydrogen atom only contains two particles: the nucleus and the electron. This is 
a two-body system and can be solved exactly by separating the motion into the centre of mass and 
relative co-ordinates. This has allowed us to find the wave functions and understand the meaning of the 
quantum numbers n, I, mi and rrig. 

We are well aware, however, that hydrogen is only the first of about 100 elements. These are not two 
body problems: we have one nucleus and many electrons, which is a many body problem, with no 
exact solution. This chapter begins our consideration of the approximation techniques that are used to 
understand the behaviour of many-electron atoms. 



4.1 The central field approximation 

The Hamiltonian for an A/^-electron atom with nuclear charge +Ze can be written in the form: 



N 



i=l 



2m 



N 



(4.1) 



where N = Z fov a neutral atom. The subscripts i and j refer to individual electrons and rij = \ri — rj\. 
The first summation accounts for the kinetic energy of the electrons and their Coulomb interaction with 
the nucleus, while the second accounts for the electron-electron repulsion. 

It is not possible to find an exact solution to the Schrodinger equation with a Hamiltonian of the 
form given by eqn 4.1, because the electron-electron repulsion term depends on the co-ordinates of two 
of the electrons, and so we cannot separate the wave function into a product of single-particle states. 
Furthermore, the electron-electron repulsion term is comparable in magnitude to the first summation, 
making it impossible to use perturbation theory either. The description of multi-electron atoms therefore 
usually starts with the central field approximation in which we re- write the Hamiltonian of eqn 4.1 
in the form:^ 

H = Y^{ " 2m^* Ke„tral(ri) j + Kesidual , (4.2) 

where Vccntrai is the central field and K-csiduai is the residual electrostatic interaction. 
The central field approximation works in the limit where 



N 

E 



Central (^i) 



> |K-esidual| 



(4.3) 



In this case, we can treat Kcsiduai as a perturbation, and worry about it later. We then have to solve a 
Schrodinger equation in the form: 



■ N 

E 



2m 



Vi -l-ycentral(r-i) 



(4.4) 



field is described as "central" if the potential energy has spherical symmetry about the origin, so that V{r) only 
depends on r. The fact that V does not depend on 6 or </> means that the force is parallel to r, i.e. it points centrally 
towards or away from the origin. 



31 



32 



CHAPTER 4. THE SHELL MODEL AND ALKALI SPECTRA 



This is not as bad as it looks. By writing^ 

* = V'i(''i)V'2(»'2)---V'Jv(rjv), (4.5) 
we end up with N separate Schrodingcr equations of the form: 

+ Kentrai(r-i) J Mri) = Ei Vi(ri) , (4.6) 
with 

E = Ei+E2---En . (4.7) 

This is much more tractable. We might need a computer to solve any one of the single particle wave 
Schrodinger equations of the type given in eqn 4.6, but at least it is possible in principle. Furthermore, the 
fact that the potentials that appear in eqn 4.6 only depend on the radial co-ordinate Tj (i.e. no dependence 
on the angles di and (t>i) means that every electron is in a well-defined orbital angular momentum state, ^ 
and that the separation of variables discussed in Section 2.3 is valid. In analogy with eqn 2.35, we can 
then write: 

tPiin) = V(ri, Oi, c/^i) = Riin) Yiieu cPi) . (4.8) 
By proceeding exactly as in Section 2.3, we end up with two equations, namely: 

LlYi^mM,<l>i) = fi'hik + m^mM^^i) > (4-9) 



and 



^ ^rf^) + + y_,,,i(r-,) ) R,{r,) = E, R,{n) . (4.10) 



2m rf dr, \ ' dr, / 2mr, 



The first tells us that the angular part of the wave functions will be given by the spherical harmonic 
functions described in Section 2.2, while the second one allows us to work out the energy and radial wave 
function for a given form of Central (''i) and value of Zj. Each electron will therefore have four quantum 
numbers: 

• I and m;: these drop out of the angular equation for each electron, namely eqn 4.9. 

• n: this arises from solving eqn 4.10 with the appropriate form of Vccntrai(''') for a given value of I. 
n and I together determine the radial wave function i?„j(r) (which cannot be expected to be the 
same as the hydrogenic ones given in Table 2.2) and the energy of the electron. 

• rris: spin has not entered the argument. Each electron can therefore either have spin up (mg = +1/2) 
or down (m^ = —1/2), as usual. We do not need to specify the spin quantum number s because it 

is always equal to 1/2. 

The state of the many-electron atom is then found by working out the wave functions of the individual 
electrons and finding the total energy of the atom according to eqn 4.7. This provides a useful working 
model that will be explored in detail below. 

In the following sections we shall consider the experimental evidence for the shell model which proves 
that the central approximation is a good one. The reason it works is based on the nature of the shells. 
An individual electron experiences an electrostatic potential due to the Coulomb repulsion from all the 
other electrons in the atom. Nearly all of the electrons in a many-electron atom are in closed sub-shells, 
which have spherically-symmetric charge clouds. The off-radial forces from electrons in these closed shells 
cancel because of the spherical symmetry. Furthermore, the off-radial forces from electrons in unfilled 
shells are usually relatively small compared to the radial ones. We therefore expect the approximation 
given in eqn 4.3 to be valid for most atoms. 



4.2 The shell model and the periodic table 

We summarize here what we know so far about atomic states. 

^Thc fact that electrons are indistinguishable particles means that we cannot distinguish physically between the case 
with electron 1 in state 1, electron 2 in state 2, . . . , and the case with electron 2 in state 1, electron 1 in state 2, . . . , etc. 
We should therefore really write down a linear combination of all such possibilities. We shall reconsider this point when 
considering the helium atom in Chapter 8. 

^As noted in Section 2.2, the torque on the electron is zero if the force points centrally towards the nucleus. This means 
that the orbital angular momentum is constant. 
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Quantum number 


symbol 


Value 


principal 


n 


any integer > 


orbital 


I 


integer up to (n — 1) 


magnetic 


m 


integer from — / to +/ 


spin 




±1/2 



Table 4.1: Quantum numbers for electrons in atoms. 



1. The electronic states arc specified by four quantum numbers: n, I, mi and m^. The values that 
these quantum numbers can take are summarized in Table 4.1. In spectroscopic notation, electrons 
with Z = 0, 1, 2, 3, . . . are called s, p, d, f, ... electrons. 

2. The gross energy of the electron is determined by n and I, except in hydrogenic atoms, where the 
gross structure depends only on n. 

3. In the absence of fine structure and external magnetic fields, all the states with the same values of n 
and I are degenerate. Each (n, /) term of the gross structure therefore contains 2{2l + 1) degenerate 
levels. 

In the shell model, we forget about fine structure and external magnetic fields, and just concentrate on 
the gross structure. This approximation is jiistified by the fact that the fine structure and magnetic field 
splittings are smaller than the gross structure energies by a factor of about Z'^a'^ = (Z^/137)^ ~ 10"** Z"^. 
If necessary, we will think about fine structure and field splittings after we have dealt with the gross 
structure of the multi-electron atoms, but note that the fine structure energy can get to be quite significant 
for large Z. 

The Pauli exclusion principle tells us that we can only put one electron in each quantum state. 

This is a consequence of the fact that the electrons are indistinguishable fermions (ie spin 1/2 particles). 
We shall discuss some of the other consequences of the particle indistinguishability in Chapter 8. We 
concentrate here on how it leads to the notion of atomic shells. 

The shell model assumes that we can order the energies of the gross structure terms of a multi- 
electron atom according to the quantum numbers n and I. We further assume that the energies of the 
terms increase in big jumps each time we move to the next set of quantum numbers. As we add electrons 
to the atom, the electrons therefore fill up the lowest available shell until it is full, and then go on to the 
next one. The filling up of the shells in order of increasing energy in multi-electron atoms is sometimes 
called the Aufbau principle,^ and is the basis of the periodic table of elements. The shells are listed 
in order of increasing energy in Table 4.2. 

Inspection of Table 4.2 shows us that the energy of the shells always increases with n and I. We build 
up multi-electron atoms by adding electrons one by one, putting each electron into the lowest energy 
shell that has unfilled states. In general, this will be the one with the lowest n, but there are exceptions 
to this rule. For example, the 19**^ electron goes into 4s shell rather than the 3d shell. Similarly, the 37*^ 
electron goes into 5s shell rather than the 4d shell. This happens because the energy of the shell with a 
large I value may be higher than that of another shell with a larger value of n hut smaller value of I. 

The periodic table of elements is built up by adding electrons into the shells as the atomic number 
increases. This allows us to determine the electronic configuration of the elements, that is, the 
quantum numbers of the electrons in the atom. The configurations of the first 11 elements are listed in 
Table 4.3. The superscript attached to the shell tells us how many electrons are in that shell. The process 
of filling the shells follows the pattern indicated in Table 4.2. The nl sub-shells are filled diagonally when 
laid out in rows determined by the principal quantum number n, as shown in Fig. 4.1.^ 

4.3 Justification of the shell model 

The theoretical justification for the shell model relies on the concept of screening. The idea is that the 
electrons in the inner shells screen the outer electrons from the potential of the nucleus. To see how this 
works we take sodium as an example. 

*The German word Aufbau means "building up" . 

^There are some exceptions to the general rules. For example, copper (Cu) with Z = 29 has a configuration of • • • 4s'^ Sd^'' 
instead of • • • 4s^ 3d*. This happens because filled shells are particularly stable. It is therefore energetically advantageous 
to promote the second 4s electron into the 3d shell to give the very stable 3d^'' configuration. The energy difference between 
the two configurations is not particularly large, which explains why copper sometimes behaves as though it is monovalent, 
and sometimes divalent. 



34 



CHAPTER 4. THE SHELL MODEL AND ALKALI SPECTRA 



Shell 


n 


/ 




nil 








Nshcll 


l^accum 


Is 


1 














±1/2 


2 


2 


2s 


2 














±1/2 


2 


4 


2p 


2 


1 




-1,0,- 


fl 




±1/2 


6 


10 


3s 


3 














±1/2 


2 


12 


3p 


3 


1 




-1,0,- 


t-1 




±1/2 


6 


18 


4s 


4 














±1/2 


2 


20 


3d 


3 


2 


-2, 


-1,0,- 


M,- 


f2 


±1/2 


10 


30 


4p 


4 


1 




-1,0,- 


M 




±1/2 


6 


36 


5s 


5 














±1/2 


2 


38 


4d 


4 


2 


-2, 


-1,0,- 


M,- 


f2 


±1/2 


10 


48 


5p 


5 


1 




-1,0,- 


M 




±1/2 


6 


54 


6s 


6 














±1/2 


2 


56 


4f 


4 


3 


-3,-2, 


-1,0,- 


M,- 




f3 ±1/2 


14 


70 


5d 


5 


2 


-2, 


-1,0,- 


M,- 


f2 


±1/2 


10 


80 


6p 


6 


1 




-1,0,- 


M 




±1/2 


6 


86 


7s 


7 














±1/2 


2 


88 



Table 4.2: Atomic shells, listed in order of increasing energy. NghoU is equal to 2(2^ + 1) and is the number 
of electrons that can fit into the shell due to the degeneracy of the m; and levels. The last column 
gives the accumulated number of electrons that can be held by the atom once the particular shell and all 
the lower ones have been filled. 



Element 


Atomic number 


Electronic configuration 


H 


1 


Isi 


He 


2 


1S2 


Li 


3 


ls2 2si 


Be 


4 


ls2 2s2 


B 


5 


ls2 2s2 2pi 


C 


6 


ls2 2s2 2p2 


N 


7 


ls2 2s2 2p3 





8 


ls2 2s2 2p4 


F 


9 


ls2 2s2 2p5 


Ne 


10 


Is^ 2s2 2p6 


Na 


11 


ls2 2s2 2p6 3si 



Table 4.3: The electronic configuration of the first 11 elements of the periodic table. 



1s 

2s ,2p' 

^ ^ ^ ^ 

AP M 

^s' .5p' Sf' 5g 

>6s ^C"? ,6cl 6f 6g 6h 

Figure 4.1: Atomic shells are filled in diagonal order when listed in rows according to the principal 
quantum number n. 
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Figure 4.2: The electronic configuration of tlie sodium atom according to the shell model. 
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Table 4.4: Radii and energies of the principal atomic shells of sodium according to the Bohr model. The 
unit of 1 Angstrom (A) = 10~^" m. 



Sodium has an atomic number of 11, and therefore has a nucleus with a charge of +lle with 11 
electrons orbiting around it. The picture of the atom based on the shell model is shown in Fig. 4.2. The 
radii and energies of the electrons in their shells are estimated using the Bohr formulae: 

Tn = ^"H' (4-11) 

En = -(^ i?H, (4.12) 

where an — 5.29 x 10^^^ m is the Bohr radius of hydrogen, i?H = 13.6 eV is the Rydberg constant and 
Z is the atomic number. 

The first two electrons go into the n = 1 shell. These electrons see the full nuclear charge of +lle. 
With n = 1 and Z = 11, we find ri = 1^/11 x an = 0.05 A and Ei = -ll^i^n = -1650 eV. The next 
eight electrons go into the n — 2 shell. These are presumed to orbit outside the n — 1 shell. The two inner 
electrons partly screen the nuclear charge, and the n = 2 electrons see an effective charge Zog = +9e. 
The radius is therefore = (2V9) x an = 0.24 A and the energy is E2 = -(9/2)2i?H = -275 eV. Finally, 
the outermost electron in the n = 3 shell orbits outside the filled n = 1 and n = 2 shells, and therefore 
sees Zoff = 1. With Z = 1 and n = 3 we find = 4.8 A and E^ = —1.5 eV. These values are summarized 
in Table 4.4. Note the large jump in energy and radius in moving from one shell to the next. 

The treatment of the screening discussed in the previous paragraph is clearly over-simplified because 
it is based on Bohr-type orbits and does not treat the electron-electron repulsion properly. In Section 4.5 
we shall see how we might improve on it. One point to realize, however, is that the model is reasonably 
self-consistent: by assuming that the inner shells screen the outer ones, we find that the orbital radius 
increases in each subsequent shell, which corroborates our original assumption. This is why the model 
works so well. 



4.4 Experimental evidence for the shell model 

There is a wealth of experimental evidence to confirm that the shell model is a good one. The main 
points are discussed briefiy here. 

Ionization potentials and atomic radii 

The ionization potentials of the noble gas elements are the highest within a particular period of the 
atomic table, while those of the alkali metals are the lowest. This can be seen by looking at the data in 
Fig. 4.3. The ionization potential gradually increases as the atomic number increases until the shell is 
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Figure 4.3: First ionization potentials of the elements up to calcium. The noble gas elements (He, Ne, 
Ar) have highly stable fully filled shells with large ionization potentials. The alkali metals (Li, Na, K) 
have one weakly-bound valence electron outside fully-filled shells. 




K O 



Figure 4.4: (a) A typical X-ray tube. Electrons are accelerated with a voltage of several kV and impact on 
a target, causing it to emit X-rays, (b) Transitions occurring in the K-series emission lines. An electron 
from the discharge tube ejects one of the K-shell electrons of the target, leaving an empty level in the 
K shell. X-ray photons are emitted as electrons from the higher shells drop down to fill the hole in the 
K-shell. 



filled, and then it drops abruptly. This shows that the filled shells are very stable, and that the valence 
electrons go in larger, less tightly-bound orbits. The results correlate with the chemical activity of the 
elements. The noble gases require large amounts of energy to liberate their outermost electrons, and 
they are therefore chemically inert. The alkali metals, on the other hand, need much less energy, and are 
therefore highly reactive. 

It is also found that the average atomic radius determined by X-ray crystallography on closely packed 
crystals is largest for the alkali metals. This is further evidence that we have weakly-bound valence 
electrons outside strongly-bound, small-radius, inner shells. 

X-ray line spectra 

Measurements of X-ray line spectra allow the energies of the inner shells to be determined directly. The 
experimental arrangement for observing an X-ray emission spectrum is shown in Fig. 4.4(a). Electrons 
are accelerated across a potential drop of several kV and then impact on a target. This ejects core 
electrons from the inner shells of the target, as shown in Fig. 4.4(b). X-ray photons are emitted as the 
higher energy electrons drop down to fill the empty level (or hole) in the lower shell. 

Each target emits a series of characteristic lines. The series generated when a K-shell (n = 1) electron 
has been ejected is called the K-series. Similarly, the L- and M-series correspond to ejection of L-shell 
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Wavelength (A) Photon Energy (keV) 

Figure 4.5: (a) X-ray emission spectra for tungsten at two different electron voltages. The sharp lines 
are caused by radiative transitions after the electron beam ejects an inner shell electron, as indicated in 
Fig. 4.4(b). The continuum is caused by bremsstrahlung, which has a short wavelength limit equal to 
hc/eV at voltage V. (b) X-ray absorption cross-section spectrum for lead. 



(n = 2) or M-shell (n = 3) core electrons respectively. This old spectroscopic notation dates back to the 
early work on X-ray spectra. 

Figure 4.5(a) shows a typical X-ray emission spectrum. The spectrum consists of a series of sharp 
lines on top of a continuous spectrum. The groups of sharp lines are generated by radiative transitions 
following the ejection of an inner shell electron as indicated in Fig. 4.4(b). As the voltage is increased, 
new groups of lines appear, as the higher energy electron beam ejects ever deeper inner shell electrons. 
Several groups of lines are observed as the hole in the initial shell moves up through the higher shells. 
For example, L-shell lines follow K-shell lines after the electron in the L-shell drops to the hole in the 
K-shell, thus leaving a hole in the L-shell, and so on. 

The continuous spectrum is caused by bremsstrahlung.^ Bremsstrahlung occurs when the electron 
is scattered by the atoms without ejecting a core electron from the target. The acceleration of the electron 
associated with its change of direction causes it to radiate. Conservation of energy demands that the 
frequency of the radiation must cut off when hi' — eV, V being the voltage across the tube. This means 
that the minimum wavelength is equal to hc/eV . The reduction of the short wavelength limit of the 
bremsstrahlung with increasing voltage is apparent in the data shown in Fig. 4.5(a). 

The energy of an electron in an inner shell with principal quantum number n is given by: 



En 



Rh , 



(4.13) 



where is the effective nuclear charge, and i?H = 13.6 eV. The difference between Z (the atomic 
number of the target) and is caused by the screening effect of the other electrons. The energy of the 
optical transition from n — >■ n' is thus given by: 



hv = \E„> - E.„ 



7cff^ 



V2 



(4.14) 



In practice, the wavelengths of the various series of emission lines are found to obey Moseley's law, 
where we make the approximation Z^ = and write both as (Z — cr„). For example, the K-shell lines 
are given by:^ 

'^«(Z-a^)^i?H(l-^) , (4.15) 
where n > 1 and ax ^ 3. Similarly, the L-shell spectra obey: 



(4.16) 



^German: brems = braking (i.e. deceleration) -|- strahlung = radiation. 

^There is no real scientific justification for the approximation = in Moseley's law. The law is an empirical one 
and reflects the fact that the transition wavelength is mainly dominated by the energy of the lower shell. Note also that 
close inspection of the line spectra reveals sub-structure due to the relatively small energy differences between the I states 
for a particular value of n, and further smaller splittings due to spin-orbit coupling. 
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Element 


Z 


Electronic configuration 


Lithium 


3 


ls22si 


Sodium 


11 


[Ne] 3si 


Potassium 


19 


[Ar] 4si 


Rubidium 


37 


[Kr] 5si 


Cesium 


55 


[Xe] 6si 



Table 4.5: Alkali metals. The symbol [. . . ] indicates that the inner shells are filled according to the 
electronic configuration of the noble gas element identified in the bracket. 



where n > 2, and ^ 10. Wc can sec that these arc just the expected wavelengths predicted by the Bohr 
model, except that we have an effective charge of (Z — cr„) instead of Z. The phenomenological screening 
parameter fT„ that appears here accounts for the screening of the nucleus by the other electrons and 
varies from shell to shell. 

X-ray absorption spectra show a complementary frequency dependence to the emission spectra. Fig- 
ure 4.5(b) shows a typical X-ray absorption spectrum. A sharp increase in the absorption cross section^ 
occurs whenever the photon energy crosses the threshold to eject an electron out of an inner shell to empty 
states above the highest occupied shell. This sharp increase in the absorption is called the absorption 
edge. The final state for the electron after the absorption transition could either be one of the excited 
states of the valence electrons or in the continuum above the ionization limit. The binding energy of the 
valence electrons is negligible on the scale of X-ray energies, and so we can effectively put En' = in 
eqn 4.14 and hence obtain the energy of the absorption edge as: 

The absorption probability decreases as the electron gets promoted further into the continuum. Hence 
wc sec a peak in the absorption at /iz/^'^sc g^j^j decrease thereafter. The K and L shell absorption edges 
are clearly visible at 88 keV and 15 keV respectively in Fig. 4.5(b) . 



4.5 Effective potentials, screening, and alkali metals 

The electrons in a multi-electron atom arrange themselves with the smallest number of electrons in 
unfilled shells outside inner filled shells. These outermost electrons are called the valence electrons of 
the atom. They are responsible for the chemical activity of the particular elements. 

In order to work out the energy levels of the valence electrons, we need to solve the iV-electron 
Schrodinger equation given in eqn 4.1. Within the central- field approximation, each valence electron 
satisfies a Schrodinger equation of the type given in eqn 4.6, which can be written in the form: 

The Coulomb repulsion from the core electrons is lumped into the effective potential VJ,'jf (r) . This is only 
an approximation to the real behaviour, but it can be reasonably good, depending on how well we work 
out V^jj(r). Note that the effective potential depends on I. This arises from the term in I that appears in 
eqn 4.10 and has important consequences, as we shall see below. 

The overall dependence of Kff (r) with r must look something like Fig. 4.6. At very large values of r, 
the outermost valence electron will be well outside any filled shells, and will thus only see an attractive 
potential equivalent to a charge of -|-e. On the other hand, if r is very small, the electron will see 
the full nuclear charge of +Ze. The potential at intermediate values of r must lie somewhere between 
these two limits: hence the generic form of 14ff(r) shown in Fig. 4.6. The task of calculating Vgg(r) 
keeps theoretical atomic physicists busy. Two common approximation techniques used to perform the 
calculations are called the Hartree and Thomas-Fermi methods. 

As a specific example, we consider the alkali metals such as lithium, sodium and potassium, which 
come from group I of the periodic table. They have one valence electron outside filled inner shells, as 

* Absorption coefficients are often expressed as "cross sections". The cross section is equal to the effective area of the 
beam that is blocked out by the absorption of an individual atom. If there are N atoms per unit volume, and the cross 
section is equal to (Tabsi the absorption coefficient in m~^ is equal to AT (Tabs- 
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Figure 4.6: (a) Typical effective potential Vcs{r) for the valence electrons of an atom with atomic number 
Z. (b) Radial probability densities for hydrogenic 3s and 3p wave functions, oq is the Bohr radius 
(0.529 A). The shaded region near r = represents the inner core shells for the case of sodium with 
Z = 11. 
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Table 4.6: Values of the quantum defect S{1) for sodium against n and /. 



indicated in Table 4.5. They are therefore approximately one-electron systems, and can be understood 
by introducing a phenomenological number called the quantum defect to describe the energies. To see 
how this works, we consider the sodium atom. 

The shell model picture of sodium is shown in Fig. 4.2. The optical spectra are determined by 
excitations of the outermost 3s electron. The energy of each (n, I) term of the valence electron is given 
by: 



where n > 3 and S{1) is the quantum defect. The quantum defect allows for the penetration of the inner 
shells by the valence electron. 

The dependence of the quantum defect on I can be understood with reference to Fig. 4.6(b). This 
shows the radial probability densities Pni{r) = r'^\R{r)\'^ for the 3s and 3p orbitals of a hydrogenic atom 
with Z = 1, which might be expected to be a reasonable approximation for the single valence electron of 
sodium. The shaded region near r — represents the inner n = 1 and n = 2 shells with radii of ^ 0.09ao 
and ~ 0.44ao respectively. (See Section 4.3.) We see that both the 3s and 3p orbitals penetrate the inner 
shells, and that this penetration is much greater for the 3s electron. The electron will therefore see a 
larger effective nuclear charge for part of its orbit, and this will have the effect of reducing the energies. 
The energy reduction is largest for the 3s electron due to its larger core penetration. 

The quantum defect d{l) was introduced empirically to account for the optical spectra. In principle 
it should depend on both n and I, but it was found experimentally to depend mainly on I. This can be 
seen from the values of the quantum defect for sodium tabulated in Table 4.6. The corresponding energy 
spectrum is shown schematically in Fig. 4.7. Note that 6{l) is very small for I > 2. 

We can use the quantum defect to calculate the wavelengths of the emission lines. The D lines 
correspond to the 3p — > 3s transition. By using the values of S given in Table 4.6, we find that the 
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Figure 4.7: Schematic energy level diagram for sodium, showing the ordering of the energy levels. 



wavelength is given by: 

he 



Rh 



A V[3-'5(3s)]2 [3-5(3p)]^ 

= (1.10 X lO^cm-i) X I — ^1 

^ ' V 1-6272 2.11727 



This gives the wave number v = 1.70 x 10^ cm ^, and so A is equal to 590nm, as we would expect for the 
yellow D-lines of sodium. 
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Chapter 5 

Angular momentum 



Wc noted in Section 2.2 that the treatment of angular momentum is very important for understanding 
the properties of atoms. It is now time to explore these effects in detail, and to see how this leads to the 
classification of the quantized states of atoms by their angular momentum. 

5.1 Conservation of angular momentum 

In the Sections that follow, we are going to consider several different types of angular momentum, and 
the ways in which they are coupled together. Before going into the details, it is useful to stress one 
very important point related to conservation of angular momentum. In an isolated atom, there are 
many forces (and hence torques) acting inside the atom. These internal forces cannot change the total 
angular momentum of the atom, since conservation of angular momentum demands that the angular 
momentum of the atom as a whole must be conserved in the absence of any external torques. The total 
angular momentum of the atom is normally determined by its electrons. The total electronic angular 
momentum is written J, and is specified by the quantum number J. The principle of conservation of 
angular momentum therefore requires that isolated atoms always have well-defined J states.^ It is this 
J value that determines, for example, the magnetic dipole moment of the atom. 

The principle of conservation of angular momentum does not apply, of coiirsc, when external pertur- 
bations are applied. The most obvious example is the perturbation caused by the emission or absorption 
of a photon. In this case the angular momentum of the atom must change because the photon itself 
carries angiilar momentum, and the angular momentum of the whole system (atom + photon) has to be 
conserved. The change in J is then governed by selection rules, as discussed, for example, in Section 5.8. 
Another obvious example is the effect of a strong external DC magnetic field. In this case it is possible 
for the magnetic field to produce states where the component of angular momentum along the direction 
of the field is well-defined, but not the total angular momentum. (See the discussion of the Paschen-Back 
effect in Section 7.1.3.) 

5.2 Types of angular momentum 

The electrons in atoms possess two different types of angular momentum, namely orbital and spin angular 
momentum. These are discussed separately below. 

5.2.1 Orbital angulcir momentum 

The orbital angular momentum of an electron was considered in Section 2.2. The electron orbits around 
the nucleus, and this gives it angular momentum I. The angular momentum is specified by two quantum 
numbers: I and m. The latter is sometimes given an extra subscript (i.e. mi) to distinguish it from the 
spin quantum number rUs considered below. 
The magnitude of I is given by 

\i\ = ^/i(T+i)h, (5.1) 

^This statement about J has to be qualified somewhat when we add in the effects of the nucleus. The angular momentum 
of an atom is the resultant of the electronic angular momentum and the nuclear spin. The total angular momentum of an 
isolated atom has to be conserved, but the electrons can exchange angular momentum with the nucleus through hyperfine 
interactions. (See Section 6.7.2.) These intera<;tions are very weak, and can usually be neglected except when explicitly 
considering nuclear effects. 
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and the component along the z axis by 

h=mh. (5.2) 

I can take positive integer values (including 0) and m can take values in integer steps from —I to +1. 
The number of m states for each I state is therefore equal to {21 + 1). These m states are degenerate in 
isolated atoms, but can be split by external perturbations (e.g. magnetic or electric fields.) 

In classical orbit problems, angular momentum is conserved when the force F is radial: i.e. F = Fr, 
where f is a unit vector parallel to r. This follows from the equation of motion: 

^=T = rxF = rxFr = 0, (5.3) 

where T is the torque. In the hydrogen atom, the Coulomb force on the electron acts towards the nucleus, 

and hence I is conserved. This is why the angular momentum ends up being quantized with well-defined 
constant values when we consider the quantum mechanics of the hydrogen atom. It is also the case 
that the individual electrons of many-electron atoms have well-defined I states. This follows because the 
central field approximation gives a very good description of the behaviour of many electron atoms (see 
Section 4.1), and the dominant resultant force on the electron is radial (i.e. central) in this limit. ^ 



5.2.2 Spin angular momentum 

A wealth of data derived from the optical, magnetic and chemical properties of atoms points to the fact 
that electrons possess an additional type of angular momentum called spin. The electron behaves as if 
it spins around its own internal axis, but this analogy should not be taken literally — the electron is, as 
far as we know, a point particle, and so cannot be spinning in any classical way. In fact, spin is a purely 
quantum effect with no classical explanation. 

The discovery of spin goes back to the Stern-Gerlach experiment, in which a beam of atoms is deflected 
by a non- uniform magnetic field. (See Fig. 1.4). The force on a magnetic dipole in a non-uniform magnetic 
field is given by:^ 

f'z = M.^, (5.4) 

where dB/dz is the field gradient, which is assumed to point along the z direction, and Hz is the z- 
component of the magnetic dipole of the atom. In Lecture Notes 6 we shall explore the origin of magnetic 
dipoles in detail. At this stage, all we need to know is that the magnetic dipole is directly proportional 
to the angular momentum of the atom. (See Section 6.1.) 

The original Stern-Gerlach experiment was performed on silver atoms, which have a ground-state 
electronic configuration of [Kr] 4d^'^ 5s^. Filled shells have no net angular momentum, because there 
are as many positive mi states occupied as negative ones. Furthermore, electrons in s-shcUs have I = 
and therefore the orbital angular momentum of the atom is zero. This implies that the orbital magnetic 
dipole of the atom is also zero, and hence we expect no deflection. However, the experiment showed that 
the atoms were deflected either up or down, as indicated in Fig. 1.4. 

In order to explain the up/down deflection of the atoms with no orbital angular momentum, we have 
to assume that each electron possesses an additional type of magnetic dipole moment. This magnetic 
dipole is attributed to the spin angular momentum. In analogy with orbital angular momentum, spin 
angular momentum is described by two quantum numbers s and m^, where rrig can take the (2s + 1) 
values in integer steps from —s to +s. The magnitude of the spin angular momentum is given by 

|s| = ^s{s + l)h, (5.5) 

and the component along the z axis is given by 

Sz = rUsh . (5.6) 

^The inclusion of non-central forces via the residual electrostatic interaction leads to some mixing of the orbital angular 
momentum states. This can explain why transitions that are apparently forbidden by selection rules can sometimes be 
observed, albeit with low transition probabilities. 

^Note that we need a non-uniform magnetic field to deflect a magnetic dipole. A uniform magnetic field merely exerts 
a torque, not a force. We can understand this by analogy with electrostatics. Electric monopoles (i.e. free charges) can 
be moved by applying electric fields, but an electric dipole experiences no net force in a uniform electric field because the 
forces on the positive and negative charges cancel. If we wish to apply a force to an electric dipole, we therefore need to 
apply a non-uniform electric field, so that the forces on the two charges are different. Magnetic monopoles do not exist 
(as far as we know), and so all atomic magnets are dipoles. Hence we must apply a non-uniform magnetic field to exert a 
magnetic force on an atom. 
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Figure 5.1: (a) Vector addition of two angular momentum vectors A and B to form the resultant C. (b) 
Vector model of the atom. The spin-orbit interaction couples I and s together to form the resultant j. 
The magnitudes of the vectors are given by: \j\ — \J 3[j + 1)^, |^| = ■\/^(? + 1)^, and \s\ = \J si^s + . 

The fact that atoms with a single s-shell valence electron (e.g. silver) are only deflected in two directions 
(i.e. up or down) implies that the spin quantum numbers of the electron must have the following values: 

s = 1/2, 
ms = ±1/2. 

The Stern-Gerlach experiment is just one of many pieces of evidence that support the hypothesis for 
electron spin. Here is an incomplete list of other evidence for spin based on atomic physics: 

• The periodic table of elements, which is the foundation of the whole subject of chemistry, cannot 
be explained unless we assume that the electrons possess spin. 

• High resolution spectroscopy of atomic spectral lines shows that they frequently consist of closely- 
spaced multiplets. This fine structure is caused by spin-orbit coupling , which can only be explained 
by postulating that electrons possess spin. See Lecture Notes 6. 

• If we ignore spin, we expect to observe the normal Zeeman effect when an atom is placed in an 
external magnetic field. However, most atoms display the anomalous Zeeman effect, which is a 
consequence of spin. See Lecture Notes 7. 

• The ratio of the magnetic dipole moment to the angular momentum is called the gyromagnetic ratio. 
(See Section 6.1.) The gyromagnetic ratio can be measured directly by a number of methods. In 
1915, Einstein and de Haas measured the gyromagnetic ratio of iron and came up with a value twice 
as large as expected. They rejected this result, assigning it to experimental errors. However, we 
now know that the magnetism in iron is caused by the spin rather the orbital angular momentum, 
and so the experimental value was correct. (The electron spin g-factor is 2: see Section 6.2.) This 
is a salutary lesson from the history that even great physicists like Einstein and de Haas can get 
their error analysis wrong! 

5.3 Addition of angular momentum 

Having discovered that electrons have different types of angular momentum, the question now arises as 
to how we add them together. Let us suppose that C is the resultant of two angular momentum vectors 
A and B as shown in Fig. 5.1(a), so that: 

C = A + B. (5.7) 

We assume for the sake of simplicity that |A| > \B\. (The argument is unaffected if \A\ < \B\.) We 
define 6 as the angle between the two vectors, as shown in figure 5.1(a). 

In classical mechanics the angle 9 can take any value from 0° to 180°. Therefore, \C\ can take any 
value from {\A\ + \B\) to {\A\ — \B\). This is not the case in quantum mechanics, because the lengths of 
the angular momentum vectors must be quantized according to: 

|A| = ^A{A + l)h 
\B\ = ^B{B + l)h 

\C\ = ^C{C + l)h, (5.8) 
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where; A. B and C arc the qiiantum numbers. This makes it apparent that can only take specific values 
in quantum mechanics. The rule for working out the allowed values of C from the known values of A and 
B is as follows: 

C = A®B = {A + B),{A + B-l),---,\A-B\, (5.9) 

where the ® symbol indicates that wc are adding together angular momentum quantum numbers. Here 
are some examples of the rule given in eqn 5.9: 

• J = L + S,L = 2,,S = l: 

J = 3el, 3 + 1=4, |3-1|=2, therefore J = 4, 3, 2. 

• L = h+l2,li^ 2, I2 = 0: 

L = 2 e 0, 2 + = 2, |2 - 0| = 2, therefore L = 2. 

• S = si + S2,si = 1/2, S2 = 1/2: 

5= 1/2 1/2, 1/2 + 1/2 = 1, |l/2- 1/2| = 0, therefore S = 1,0. 

• J = Ji+j2, Ji =5/2, j2 =3/2: 

J = 5/2 © 3/2, 5/2 + 3/2 = 4, |5/2 - 3/2| = 1, therefore J = 4, 3, 2, 1. 



5.4 Spin-orbit coupling 

The orbital and spin angular momenta of electrons in atoms are not totally independent of each other, 
but interact through the spin-orbit interaction. Spin-orbit coupling and its effects are considered in 
detail in Lecture Notes 6, and at this stage we just need to know two basic things: 

1. Spin-orbit coupling derives from the interaction between the magnetic dipole due to spin and the 
magnetic field that the electron experiences due to its orbital motion. We can thus write the 
spin-orbit interaction in the form (see eqn 6.32): 

H = -Atspin • -Borbital Ocl ■ S , (5.10) 

since n^p^^ oc s and -Borbitai oc I. 

2. The spin-orbit interaction scales roughly as Z^. (See eqn 6.42.) It is therefore weak in light atoms, 

and stronger in heavy atoms. 

We introduce the spin-orbit interaction here because it is one of the mechanisms that is important in 
determining the angular momentum coupling schemes that apply in different atoms. 



5.5 Angular momentum coupling in single electron atoms 

If an atom has just a single electron, the addition of the orbital and spin angular momenta is relatively 

straightforward. The physical mechanism that couples the orbital and spin angular momenta together is 
the spin-orbit interaction, and the resultant total angular momentum vector j is defined by: 

j=l + s. (5.11) 

j is described by the quantum numbers j and mj according to the usual rules for quantum mechanical 
angular momenta, namely: 

|j| = \/j(j + l)fi, (5.12) 

and 

jz=mjh, (5.13) 

where rrij takes values of j, (j — 1), • • • , —j. The addition of / and s to form the resultant j is illustrated 
by Fig. 5.1(b).'' 

The allowed values of j are worked out by applying eqn 5.9, with the knowledge that the spin quantum 
number s is always equal to 1/2. If the electron is in a state with orbital quantum number I, wc then 
find j = l(Bs = {l±l/2), except when I = 0, in which case we just have j = 1/2. In the second case, the 
angular momentum of the atom arises purely from the electron spin. 



^Graphical representations of the type shown in Fig. 5.1 are called vector models. We shall encounter vector models 
again when we come to study the Zeeman effect in Lecture Notes 7. 
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5.6 Angular momentum coupling in multi-electron atoms 

The Hamiltonian for an A'^-electron atom can be written in the form: 

H = Ho + Hi+H2, (5.14) 



where; 



^0 = J2[-^'^1+Vcentr.liri)], (5.15) 

i=l ^ ^ 

= -El^+E ....|..._.„| -EWrO, (5.16) 



AT 

^2 = J2^{ri)li-Si. (5.17) 

As discussed in Section 4.1, Hq is the central-field Hamiltonian and Hi is the residual electrostatic 
potential. H2 is the spin-orbit interaction summed over the electrons of the atom. 

In Chapter 4 we neglected both Hi and H2, and just concentrated on Hq. This led to the conclusion 
that each electron occupies a state in a shell defined by the quantum numbers n and I. The reason why 
we neglected Hi is that the off-radial forces due to the electron-electron repulsion are smaller than the 
radial ones, while H2 was neglected because the spin-orbit effects arc much smaller than the main terms 
in the Hamiltonian. It is now time to study what happens when these two terms are included. In doing 
so, there are two obvious limits to consider:^ 

• LS coupling: Hi ^ H2- 

• jj coupling: > 

Since the spin-orbit interaction scales approximately as Z^, LS-coupling mainly occurs in atoms with 
small to medium Z, while jj-coupling occurs in some atoms with large Z. In the sections below, we focus 
on the LS-coupling limit. The less common case of jj-coupling is considered briefly in Section 5.10. 



5.7 LS coupling 

In the LS-coupling limit (alternatively called Russell— Saunders coupling), the residual electrostatic 
interaction is much stronger than the spin-orbit interaction. We therefore deal with the residual elec- 
trostatic interaction first and then apply the spin-orbit interaction as a perturbation. The LS coupling 
regime applies to most atoms of small and medium atomic number. 

Let us first discuss some issues of notation. We shall need to distinguish between the quantum 
numbers that refer to the individual electrons within an atom and the state of the atom as a whole. The 
convention is: 

• Lower case quantum numbers (j, I, s) refer to individual electrons within atoms. 

• Upper case quantum numbers (J, L and S) refer to the angular momentum states of the whole 
atom. 

For single electron atoms like hydrogen, there is no difference. However, in multi-electron atoms there 
is a real difference because we must distinguish between the angular momentum states of the individual 
electrons and the resultants which give the angular momentum states of the whole atom. 

We can use this notation to determine the angular momentum states that the LS-coupling scheme 
produces. The residual electrostatic interaction has the effect of coupling the orbital and spin angular 
momenta of the individual electrons together, so that we find their resultants according to: 

L = ^Zi, (5.18) 

i 

S = ^Si. (5.19) 



^In some atoms with medium-large Z (e.g. germanium Z = 32) we are in the awkward situation where neither Umit 
apphes. We then have intermediate coupling, and the behaviour is quite complicated to describe. 
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Filled shells of electrons have no net angular rnonientnrn. and so the summation only needs to be carried 
out over the valence electrons. In a many-electron atom, the rule given in eqn 5.9 usually allows several 
possible values of the quantum numbers L and S for a particular electronic configuration. Their energies 
will differ due to the residual electrostatic interaction. The atomic states defined by the values of L and 
S are called terms. 

For each atomic term, we can find the total angular momentum of the whole atom from: 

J = L + S. (5.20) 

The values of J, the quantum number corresponding to J, are found from L and S according to eqn 5.9. 
The states of different J for each LS-term have different energies due to the spin-orbit interaction. In 
analogy with eqn 5.10, the spin-orbit interaction of the whole atom is written: 

A£,o <x -H%T ■ BfrCnai <x i • S , (5.21) 

where the 'atom' superscript indicates that we take the resultant values for the whole atom. The details 
of the spin-orbit interaction in the LS coupling limit are considered in Section 6.6. At this stage, all we 
need to know is that the spin-orbit interaction splits the LS terms into levels labelled by J. 

It is convenient to introduce a shorthand notation to label the energy levels that occur in the LS 
coupling regime. Each level is labelled by the quantum numbers J, L and S and is represented in the 
form: 

The factors (25* -I- 1) and J appear as numbers, whereas L is a letter that follows the rule: 

• S implies L = 0, 

• P implies L — 1, 

• D implies L = 2, 

• F implies L = 3, etc. 

Thus, for example, a term is the energy level with quantum numbers S = 1/2, L = 1, and J = 1/2, 

while a "^Ds has S = 1, L = 2 and J = 3. The factor of (25-1-1) in the top left is called the multiplicity. 
It indicates the degeneracy of the level due to the spin: i.e. the number of Ms states available. If S = 0, 
the multiplicity is 1, and the terms are called singlets. If 5 = 1/2, the multiplicity is 2 and we have 
doublet terms. If 5* = 1 we have triplet terms, etc. 

As an example, consider the (3s, 3p) electronic configuration of magnesium, where one of the valence 
electrons is in an s-shell with I = and the other is in a p-shell with 1 = 1. We first work out the LS 
terms: 

• L = h®l2=0®l = l. 

• S = si © S2 = 1/2 ® 1/2 = 1 or 0. 

We thus have two terms: a triplet and a ^P singlet. The allowed levels are then worked out as follows: 

• For the ^p triplet, we have J = L 8 5' = 1 8 1 = 2, 1, or 0. We thus have three levels: ^P2, ^Pi, 
and 3Po. 

• For the -^P singlet, we have J = L®S'=1©0=:1. We thus have a single -^Pi level. 

These levels are illustrated in Fig. 5.2. The ordering of the energy states should not concern us at this 
stage. The main point to realize is the general way the states split as the new interactions are turned on, 
and the terminology used to designate the states. 

5.8 Electric dipole selection rules in the LS coupling limit 

When considering electric-dipole transitions between the states of many-electron atoms that have LS- 
coupling, a single electron makes a jump from one atomic shell to a new one. The rules that apply to 
this electron are the same as the ones discussed in Section 3.4. However, we also have to think about the 
angular momentum state of the whole atom as specified by the quantum numbers (L, S, J). The rules 
that emerge are as follows: 



5.9. HUND'S RULES 



47 



^ 

/ 
/ 
/ 

/ 

3p 

// 
/ / 
/ / 

3s3p / / 

configuration >==> terms levels 

residual spin-orbit 
electrostatic coupling 
interaction 

Figure 5.2: Splitting of the energy levels for the (3s, 3p) configuration of magnesium in the LS coupling 
regime. 

1. The parity of the wave function must change. 

2. Al = ±1 for the electron that jumps between shells. 

3. AL = 0, ±1, but L = ^ is forbidden.^ 

4. A J = 0, ±1, but J = ^ is forbidden. 

5. AS^O. 

Rule 1 follows from the odd parity of the dipole operator. Rule 2 applies the A/ = ±1 single-electron 
rule to the individual electron that makes the jump in the transition, while Rule 3 applies Rule 2 to the 
resultant orbital angular momentum of the whole atom according to the rules for addition of angular 
momenta. Rule 4 follows from the fact that the total angular momentum must be conserved in the 
transition, allowing us to write: 

^initial jiinal ^ ^photon 

The photon carries one unit of angular momentum, and so we conclude from eqn 5.9 that AJ = 
— 1, 0, or +1. However, the AJ = rule cannot be applied to J = — 7> transitions because it is 
not possible to satisfy eqn 5.22 in these circumstances. Finally, rule 5 is a consequence of the fact that 
the photon does not interact with the spin.^ 



1 








5.9 Hund's rules 

We have seen above that there are many terms in the energy spectrum of a multi-electron atom. Of these, 
one will have the lowest energy, and will form the ground state. All the others are excited states. Each 
atom has a unique ground state, which is determined by minimizing the energy of its valence electrons 
with the residual electrostatic and spin-orbit interactions included. In principle, this is a very complicated 
calculation. Fortunately, however, Hund's rules allow us to determine which level is the ground state 
for atoms that have LS-coupling without lengthy calculation. The rules are: 

1. The term with the largest multiplicity (i.e. largest S) has the lowest energy. 

2. For a given multiplicity, the term with the largest L has the lowest energy. 

3. The level with J = |L — S"! has the lowest energy if the shell is less than half full. If the shell is 
more than half full, the level with J = L + S has the lowest energy. 

^AL = transitions are obviously forbidden in one-electron atoms, because L = I and I must change. However, in atoms 
with more than one valence electron, it is possible to get transitions between different configurations that satisfy rule 2, but 
have the same value of L. An example is the allowed 3p3p''Pi —> 3p4s''P2 transition in silicon at 250.6 nm. 

'''AS ^ transitions can be weakly allowed when the spin-orbit coupling is strong, because the spin is then mixed with 
the orbital motion. 
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The first of tlicsc rules basically tells us that the electrons try to align themselves with their spins parallel 
in order to minimize the exchange interaction. (See Chapter 8.) The other two follow from the minimizing 
the spin-orbit interaction. 

Let us have a look at carbon as an example. Carbon has an atomic number Z ~ 6 with two valence 
electrons in the outermost 2p shell. Each valence electron therefore has / = 1 and s = 1/2. Consider first 
the (2p,np) excited state configuration with one electron in the 2p shell and the other in the np shell, 
where n > 3. We have from eqn 5.9 that L = 1 ® 1 = 0, 1 or 2, and 5 = 1/2 © 1/2 = or 1. We thus 
have three singlet terms (^S, ^P, ^D), and three triplet terms ("^S, '^P, '^D). This gives rise to three singlet 
levels: 

^So, ^Pi, , 

and seven triplet levels: 

^Si, 3po, 3p^^ 3p^^ 3j3^^ 3j3^^ 3d^_ 

We thus have a confusing array of ten levels in the energy spectrum for the (2p,np) configuration. 

The situation in the ground state configuration (2p,2p) is simplified by the fact that the electrons are 
equivalent, i.e. in the same shell. The Pauli exclusion principle forbids the possibility that two or more; 
electrons should have the same set of quantum numbers, and this means that L + S must be equal to 
an even number. There is no easy explanation for this rule, but the simplest example of its application, 
namely to two electrons in the same s-shell, is considered in Section 8.3. For these two s-electrons, we 
have L = © = and S = 1/2 © 1/2 = or 1, giving rise to two terms: ^S and ^S. Both terms are 
allowed when the electrons are in different s-shells, but the L + S rule tells us that only the singlet ^S 
term is allowed if the ele(;trons arc in the same s-shcll. The proof that the triplet term does not exist for 
the (Is, Is) ground-state configuration of helium is given in Section 8.3. 

On applying the rule that L + S must be even to the equivalent 2p electrons in the carbon ground 
state, wo find that only the ^S, ^D, and ^P terms are allowed, which means that only five of the ten levels 
listed above are possible:* 

^So, 3Po, 'Pi, 'P2. 

We can now apply Hund's rules to find out which of these is the ground state. The first rule states that 
the triplet levels have the lower energy. Since these all have L = 1 we do not need to consider the second 
rule. The shell is less than half full, and so we have J = \L — S\ = 0. The ground state is thus the ^Pq 
level. All the other levels are excited states. 

It is important to notice that, if we had forgotten the rule that L + S must be even, we would have 
incorrectly concluded from Hund's rules that the ground state is a '^Di term, which does not exist for the 
(2p,2p) configuration. It is therefore safer to use a different version of Hund's rules, based on the allowed 
combinations of {ms,mi) sub-levels: 

1. Maximize the spin and set 5 = ^mg. 

2. Maximize the orbital angular momentum, subject to rule 1, and set L = Y^mi. 

3. J =\L — S\ if the shell is less than half full, otherwise J = |L -|- 5|. 

These rules should work in all cases, since they incorporate the Pauli exclusion principle properly. 

As an example of how to use the second version of Hund's rules, we apply them again to the two 2p 
electrons of carbon. The two electrons can go into the six possible (m^, m;) sub-levels of the 2p shell. 

1. To get the largest value of the spin, we must have both electron spins aligned with = -1-1/2. 

This gives S* = 1/2 + 1/2 = 1. 

2. Having put both electrons into spin up states, we cannot now put both electrons into mi = +1 
states because of Pauli's exclusion principle. The best we can do is to put one into an m; = 1 state 
and the other into an m; = state, as illustrated in Table 5.1. This gives L = 1 -|- = 1. 

3. The shell is less than half full, and so we have J =\L — S\=Q. 

We thus deduce that the ground state is the ^Pq level, as before. 

The ground state levels for the first 11 elements, as worked out from Hund's rules, arc listed in 
Table 5.2. Experimental results confirm these predictions. Note that full shells always give ^Sq level with 
no net angular momentum: S = L = J = 0. 



*The full derivation of the allowed states for the (np,np) configuration of a group IV atom is considered, for example, 
in Woodgate, Elementary Atomic Structure, 2nd Edition, Oxford University Press, 1980, Section 7.2. 
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Table 5.1: Distribution of the two valence electrons of the carbon ground state within the rus and m; 
states of the 2p shell. 



Z Element Configuration Ground state 



1 


H 


Isi 








2 


He 


ls2 








3 


Li 


ls2 


2si 






4 


Be 


ls2 


2s2 






5 


B 


ls2 


2s2 




^Pl/2 


6 


C 


ls2 


2s2 


2p^ 


^Po 


7 


N 


ls2 


2s2 


2p3 


^^3/2 


8 





ls2 


2s2 


2p4 


^P2 


9 


F 


ls2 


2s2 


2p5 


'P3/2 


10 


Ne 


ls2 


2s2 


2p6 


^So 


11 


Na 


ls2 


2s2 


2p'^ 3si 


^Si/2 



Table 5.2: Electronic configurations and ground state terms of the first 11 elements in the periodic table. 

It is important to be aware that Hund's rules cannot be used to find the energy ordering of excited 
states with reliability. For example, consider the (2p,3p) excited state configuration of carbon, which has 
the ten possible levels listed previously. Hund's rules predict that the ^Di level has the lowest energy, 
but the lowest state is actually the ^Pi level. 

5.10 jj coupling 

The spin-orbit interaction gets larger as Z increases. (See, for example, eqn 6.42.) This means that in 
some atoms with large Z (eg tin with Z = 50) we can have a situation in which the spin-orbit interaction 
is much stronger than the residual electrostatic interaction. In this regime, jj coupling coupling occurs. 
The spin-orbit interaction couples the orbital and spin angular momenta of the individual electrons 
together first, and we then find the resultant J for the whole atom by adding together the individual js: 

ji = h + Si 

N 

J = Y.ji (5-23) 

i=l 

These J states are then split by the weaker residual electrostatic potential, which acts as a perturbation. 
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Chapter 6 

Fine structure 



In the previous five sets of notes we have been mainly studying the gross structure of atoms. When we 
consider the gross structure, we include only the largest interaction terms in the Hamiltonian, namely, 
the electron kinetic energy, the electron-nuclear attraction, and the electron-electron repulsion. 

It is now time to start considering the smaller interactions in the atom that arise from magnetic 
effects. In this chapter we shall consider only those effects caused by internal magnetic fields, leaving 
the discussion of the effects produced by external fields to the next set of notes. The internal fields 
within atoms cause fine structure in atomic spectra. We shall start by considering the fine structure of 
hydrogen and then move on to many-electron atoms. At the end of these notes we shall also look briefly 
at hyperfine structure, which is a similar, but smaller, effect due to the magnetic interactions between 
the electrons and the nucleus. 



6.1 Orbital magnetic dipoles 

The quantum numbers n and I were first introduced in the old quantum theory of Bohr and Sommerfeld. 
The principal quantum number n was introduced in the Bohr model as a fundamental postulate con- 
cerning the quantization of the angular momentum (see eqn 1.5), while the orbital quantum number 
I was introduced a few years later by Sommerfeld as a patch-up to account for the possibility that the 
atomic orbits might be elliptical rather than circular. In Lecture Notes 2, we saw how these two quantum 
numbers naturally re-appear in the full quantum mechanical treatment of the hydrogen atom. Then, in 
Section 4.1. we saw how they carry across to many-electron atoms. 

Two key results that drop out of the quantiim mechanical treatment of atoms are: 

• The magnitude L of the orbital angular momentum of an electron is given by (see eqn 2.32): 

L=^l{l + l)h, (6.1) 
where I can take integer values up to (n — 1). 

• The component of the angular momentum along a particular axis (usually taken as the z axis) is 
quantized in units of H and its value is given by (see eqn 2.33): 

Lz = mih , (6.2) 

where the magnetic quantum number m; can take integer values from — / to +/. 

These two relationships give rise to the vector model of angular momentum illustrated in Fig. 2.1. 

The orbital motion of the electron caiises it to have a magnetic moment. Let us first consider an 
electron in a circular Bohr orbit, as illustrated in Fig. 6.1(a). The electron orbit is equivalent to a current 
loop, and we know from electromagnetism that current loops behave like magnets. The electron in the 
Bohr orbit is equivalent to a little magnet with a magnetic dipole moment given by: 

H = ix Area = -(e/T) x (7rr^) , (6.3) 

where T is the period of the orbit. Now T = 2-Kr/v, and so we obtain 

ev o e e ^ ,„ 

11 = -— -KT =-- — mevr = -- — L, (6.4) 
27rr zrrie 2me 
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Figure 6.1: (a) The orbital motion of the electron around the nucleus in a circular Bohr orbit is equivalent 
to a current loop, which generates a magnetic dipole moment, (b) Magnetic dipole moment of an electron 
in a non-circular orbit. 



where we have substituted L for the orbital angular momentum m^vr. 

This relationship can easily be generalized to the case of electrons in non-circular orbits. Consider 
an electron at position vector r in a non-circular orbit with an origin O. The magnetic dipole moment is 
given by: 

/X = i idA, (6.5) 



where i is the current in the loop and dA is the incremental area swept out by the electron as it performs 
its orbit. The incremental area dA is related to the path element Au by: 

dA = i r X du , (6.6) 

and so eqn 6.5 becomes: 

fi = - (j) ir X du . (6.7) 

We can write the current as z = dq/dt, where q is the charge, which implies: 

n — - (p — r X du , 

1 
2 



dqr X V , 

dqr X p , (6.8) 
where v is the velocity, and p is the momentum. The angular momentum is defined as usual by 



1 

2mp 



L = r X p (6.9) 

and so we finally obtain: 

H= — (f Ldq= —L (f dq= —L{-e) , (6.10) 
2mo J 2mo J 2mc 

as in eqn 6.4. Note that the result works because the angular momentum X is a constant of the motion 
in the central field approximation (see Section 5.2.1), and so it can be taken out of the integral. 

Equation 6.4 shows us that the orbital angular momentum is directly related to the magnetic dipole 
moment. The quantity e/2me that appears is called the gyromagnetic ratio. It specifies the propor- 
tionality constant between the angular momentum of an electron and its magnetic moment. It is apparent 
from eqns 6.1 and 6.4 that the magnitude of atomic magnetic dipoles is given by: 

= ^ny/i{[Ti) = fiBVW+^) , (6.11) 

2 mo 

where /j,b is the Bohr magneton defined by: 

eh 

^iB = 7^ — = 9.27 X 10^24 jrp-i^ (g -^2) 

2mp 
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This shows that the size of atomic dipolcs is of order /ib- In many cases we are interested in the z 
component of the magnetic dipole, which is given from eqns 6.2 and 6.4 as: 

= = -/J-BTni , (6.13) 
where m; is the orbital magnetic quantum number. 



6.2 Spin magnetism 

We have seen in Section 5.2.2 that electrons also have spin angular momentum. The deflections measured 

in the Stern-Gcrlach experiment (see Fig. 1.4) enabled the magnitude of the magnetic moment due to 
the spin angular momentum to be determined. The component along the z axis was found to obey: 

= -gsl^BrUs , (6-14) 

where gs is the g- value of the electron, and rrig = ±1/2 is the magnetic quantum number due to spin. 
This is identical in form to eqn 6.13 apart from the factor of Qg. The experimental value of gs was 
found to be close to 2. The Dirac equation predicts that should be exactly equal to 2, and more 
recent calculations based on quantum electrodynamics (QED) give a value of 2.0023192, which agrees 
very accurately with the most precise experimental data. 



6.3 Spin-orbit coupling 

The fact that electrons in atoms have both orbital and spin angular momentum leads to a new interaction 
term in the Hamiltonian called spin-orbit coupling. Sophisticated theories of spin-orbit coupling (e.g. 
those based on the Dirac equation) indicate that it is actually a relativistic effect. At this stage it is more 
useful to consider spin-orbit coupling as the interaction between the magnetic fleld due to the orbital 
motion of the electron and the magnetic moment due to its spin. This more intuitive approach is the one 
we adopt here. We start by giving a simple order of magnitude estimate based on the semi-classical Bohr 
model, and then take a more general approach that works for the fully quantum mechanical picture. 



6.3.1 Spin-orbit coupling in the Bohr model 

The easiest way to understand the spin-orbit coupling is to consider the single electron of a hydrogen atom 
in a Bohr-like circular orbit around the nucleus, and then shift the origin to the electron, as indicated in 
Fig. 6.2. In this frame, the electron is stationary and the nucleus is moving in a circular orbit of radius 
r„. The orbit of the micleus is equivalent to a current loop, which produces a magnetic field at the origin. 
Now the magnetic field produced by a circular loop of radius r carrying a current i is given by: 

^. = 1?, (6-15) 

where z is taken to be the direction perpendicular to the loop. As in Section 6.1, the current i is given 
by the charge Ze divided by the orbital period T = 2wr/v. On substituting for the velocity and radius 
in the Bohr model from eqns 1.15 and 1.16, we find: 

74 



where a = e^/2eohc « 1/137 is the fine structure constant defined in eqn 1.18. For hydrogen with 

Z ^ n ^ 1, this gives « 12 Tesla, which is a large field. 

The electron at the origin experiences this orbital field and we thus have a magnetic interaction energy 
of the form: 

^■^so = -/^spin ■ -Borbital , (6-1''') 

which, from eqn 6.14, becomes: 

AEso = gsHBirisB^ = ±^bB^ , (6.18) 

where we have used gg = 2 and mg = ±1/2 in the last equality. By substituting from eqn. 6.16 and 
making use of eqn 6.12, we find: 

Z-^^ fioace'^h _ ^2^2 
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Figure 6.2: An electron moving with velocity v through the electric field £ of the nucleus experiences a 
magnetic field equal to [Sy,v) j c? . The magnetic field can be understood by shifting the origin to the 
electron and calculating the magnetic field due to the orbital motion of the nucleus around the electron. 
The velocity of the nucleus in this frame is equal to —v. 



where i?„ is the quantized energy given by eqn 1.10. For the n — \ orbit of hydrogen, this gives: 

|A£;,„| = a^R^ = 13.6eV/1372 = 0.7meV = Qcm-^ . 

This shows that the spin-orbit interaction is about 10^ times smaller than the gross structure energy 
in hydrogen. Note that the relative size of the spin-orbit interaction grows as Z^, so that spin-orbit 
effects are expected to become more important in heavier atoms, which is indeed the case. Note also that 
eqn 6.19 can be re- written using eqn 1.16 as 




(6.20) 



which shows that the spin-orbit interaction energy is of the same magnitude as the relativistic corrections 
that would be expected for the Bohr model. This is hardly surprising, given that Dirac tells us that we 
should really think of spin-orbit coupling as a relativistic effect. 



6.3.2 Spin-orbit coupling beyond the Bohr model 

In this sub-section we repeat the calculation above but without making use of the semi-classical results 
from the Bohr model. The electrons experience a magnetic field as they move through the electric field of 
the nucleus. If the electron velocity is v, it will see the nucleus orbiting around it with a velocity of — 1>, 
as shown in Fig. 6.2. The magnetic field generated at the electron can be calculated by the Biot-Savart 
law as shown by Fig. 6.3. This gives the magnetic field at the origin of a loop carrying a current i as: 

^0 / . dtt X r 



47r /loop r'^ 

where du is an orbital path element. For simplicity we consider the case of a circular orbit with constant 
r. In this case we have: 

/ du 

i du = (p — dtt — Ze—- = Ze(—v) . 
J dt dt ^ ' 

We thus obtain: 

„ /in Ze IJ-o Ze , . 

B = -^^v X r = X V . (6.22 

For a Coulomb field the electric field £ is given by: 

Ze ^ Ze 

r, (6.23) 



AneQr^ Aireor^ 

where the hat symbol on r in the first equality indicates that it is a unit vector. On combining equations 
6.22 and 6.23 we obtain: 

B ^ liQeaSxv . (6.24) 
We know from Maxwell's equations that /^oco — l/c'^i a-nd so we can re- write this as: 

B = ^Sxv. (6.25) 

c/ 
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du 



O 



Figure 6.3: The magnetic field at the origin O due to a loop carrying a current i is calculated by the 
Biot-Savart law given in Eq. 6.21. The field points out of the paper. 



The same formula can also be derived for the more general case of non-circular orbits and non-Coulombic 
electric fields such as those found in multi-electron atoms. 
The spin-orbit interaction energy is given by: 

AEso — — /Xgpijj'Borbital , (6.26) 

where /Xgpin is the magnetic moment due to spin, which is given by: 

Mspin = ~9s7) — « = -9s-r-s . (6.27) 

On substituting Eqs. 6.25 and 6.27 into Eq. 6.26, we obtain: 

AE,, = ^s.i£xv). (6.28) 

If we have a central field (ie the potential V is a function of r only), we can write: ""^ 

1 r dV" 

5 = --^. (6.29) 
e r dr 

On making use of this, the spin-orbit energy becomes: 

where we have substituted v = p/rric. On recalling that the angular momentum I is defined as r X p, 
we find: 

gsUB f IdV 



AE,, = -^f^[~ s-l. (6.31) 

This calculation of AEgo does not take proper account of relativistic effects. In particular, we moved the 
origin from the nucleus to the electron, which is not really allowed because the electron is accelerating 
all the time and is therefore not an inertial frame. The translation to a rotating frame gives rise to an 
extra effect called the Thomas precession which reduces the energy by a factor of 2. (See Eisberg and 
Resnick, Appendix O.) On taking the Thomas precession into account, and recalling that /^b — eh/2ma, 
we obtain the final result: 

^° 2 2c^m^ (^r dr ) ^ 

This is the same as the result derived from the Dirac equation, except that gg is exactly equal to 2 in 
Dirac's theory. Equation 6.32 shows that the spin and orbital angular momenta are coupled together. If 
we have a simple Coulomb field and take gs — 2, we find 

Ze^ ( 1 



We can use this formula for hydrogenic atoms, while we can use the more general form given in Eq. 6.32 for 
more complicated multi-electron atoms where the potential will differ from the Coulombic 1 jr dependence 
due to the repulsion between the electrons. 



^It is easy to verify that this works for a Coulomb field where V = — Ze^/47reo^ and £ is given by eqn 6.23. 



56 



CHAPTER 6. FINE STRUCTURE 



/ =l,i = 3/2 
\^ 

n = 2 /l = Q,j=m \ 

V 

/ = 0,1 \ 



7 = 3/2 
0.365 cm- 



l=\,j=\l2 

Gross 



1 = I 

0.035 cm- 



structure 



j=\l2 " i=x t 

+ spin-orbit + Dirac theory + Lamb shift 



Figure 6.4: Fine structure in the n — 2 level of hydrogen. 

6.4 Evaluation of the spin-orbit energy for hydrogen 

The magnitude of the spin-orbit energy can be calculated from eqn 6.32 as: 



2c^m^ \r dr ^ 

where we have taken gs — 2, and the (• • • ) notation indicates that we take expectation values: 

""""^^ ^ ^ ^ i^nim (--f-) i'nimr'^siTLeArd9d4>. (6.35) 



r dr / JJJ \r dr J 

The function {dV/dr)/r depends only on r, and so we are left to calculate an integral over r only: 



r dr I Jq \r dr J 

where Rni{r) is the radial wave function. This integral can be evaluated exactly for the case of the 
Coulomb field in hydrogen where (dV^/dr)/r oc and the radial wave functions are known exactly. 

(See Table 2.2.) We then have, for I > 1: 

/ 1 \ ^ .fi07^ 
rdr/°'\r3/ a^nHil + + 1) ' ^ ^ 

This shows that we can re-write eqn 6.34 in the form: 

AEso - Cr^i (l-s), (6.38) 

where Cni is a constant that depends only on n and I. 

We can evaluate {I • s) by realizing from eqn 5.11 that we must have: 

f = {l + s)^ =f + + 21- s. (6.39) 

This implies that: 

{l.s) = -e- s^)^ = + 1) - 1(1 + 1) - .(. + 1)] . (6.40) 

We therefore find: 

Ai?so = [jU + 1) - 1(1 + 1) - sis + 1)] , (6.41) 
where C'^i — Cnifi^/2. On using eqn 6.37 we obtain the final result for states with / > 1: 

= -"^^^J^T^T^^iji^ + 1) - + 1) - sis + 1)] , (6.42) 

where a w 1/137 is the fine structure constant, and En = —R-nZ^/n^ is equal to the gross energy. 
For states with Z = it is apparent from eqn 6.34 that AE'so = 0. 

The fact that j takes values of Z -f 1/2 and Z — 1/2 for I > 1 means that the spin-orbit interaction splits 
the two j states with the same value of /. We thus expect the electronic states of hydrogen with Z > 1 to 
split into doublets. However, the actual fine structure of hydrogen is more complicated for two reasons: 



6.5. SPIN-ORBIT COUPLING IN ALKALI ATOMS 



57 



(a) 



n, I 



J=(L + V2) 



CL 



C{L+l) 



J={L-V2) 




levels 



Figure 6.5: Spin-orbit interactions in alkali atoms, (a) The spin-orbit interaction splits the nl states into 
a doublet if / 7^ 0. (b) Fine structure in the yellow sodium D lines. 



1. States with the same n but different / are degenerate. 

2. The spin-orbit interaction is small. 

The first point is a general property of pure one-electron systems, and the second follows from the scaling 
of AEgo/En with Z"^. A consequence of point 2 is that other relativistic effects that have been neglected 
up until now are of a similar magnitude to the spin-orbit coupling. In atoms with higher values of Z, the 
spin-orbit coupling is the dominant relativistic correction, and wc can neglect the other effects. 

The fine structure of the n — 2 level in hydrogen is illustrated in figure 6.4. The fully relativistic Dirac 
theory predicts that states with the same j are degenerate. The degeneracy of the two j = 1/2 states is 
ultimately lifted by a quantum electrodynamic (QED) effect called the Lamb shift. The complications of 
the fine structure of hydrogen due to other relativistic and QED effects means that hydrogen is not the 
paradigm for understanding spin-orbit effects. The alkali metals considered below are in fact simpler to 
understand. 



6.5 Spin-orbit coupling in alkali atoms 



Alkali atoms have a single valence electron outside close shells. Closed shells have no angular momentum, 
and so the angular momentum state \L, S, J) of the atom is determined entirely by the valence electron. 
By analogy with the results for hydrogen given in eqns 6.38 and eqn 6.41, we can write the spin-orbit 
interaction term as: 

AEso oc (L • cx [J( J + 1) - L{L + 1) - S{S + 1)] . (6.43) 

It follows immediately that the spin-orbit energy is zero when the valence electron is in an s-shell, since 
L- S = when L = 0. (Alternatively: J = S if L = 0, so J{J + 1) - L{L + 1) - S{S + 1) - 0.) 

Now consider the case when the valence electron is in a shell with / 0. We now have L — / and 
S ^ 1/2, so that L • S ^ 0. J has two possible values, namely J = L®S' = Lel/2 = i±l/2. On 
writing eqn 6.43 in the form: 



AEso = C [J{J + 1) - L{L + 1) - Sis + 1)] 
the spin-orbit energy of the J = {L + 1/2) state is given by: 



(6.44) 



: C 



\L+l)iL+l)-LiL + l)-l.l 



-■+CL, 



while for the J = [L — 1/2) level we have: 



AE,^ = C 



[L-\)[L- 



1 



)-L{L + l)- 



1 



= -C(L+l). 



Hence the term defined by the quantum numbers n and I is split by the spin-orbit coupling into two 
new states, as illustrated in figure 6.5(a). This gives rise to the appearance of doublets in the atomic 
spectra. The magnitude of the splitting is smaller than the gross energy by a factor ^ 
(See Eq. 6.42.) This is why these effects are called "fine structure", and a is called the 
constant" . 

As an example, let us consider sodium, which has 11 electrons, with one valence electron outside filled 
Is, 2s and 2p shells. It can therefore be treated as a one electron system, provided we remember that 



a2 = 1/137^ 
■fine structure 
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this is only an approximation. One immediate consciquence is that the differing I states arising from the 
same value of n are not degenerate as they are in hydrogen. (See section 4.5.) The bright yellow D lines 
of sodium correspond to the 3p — >■ 3s transition. 

It is well known that the D-lincs actually consist of a doublet, as shown in Fig. 6.5(b). The doublet 
arises from the spin-orbit coupling. The ground state is a level with zero spin-orbit splitting. The 

excited state is split into the two levels derived from the different J values for i = 1 and 5 = 1/2, namely 
the ^P3/2 and levels. The two transitions in the doublet are therefore: 

and 

The energy difference of 17cm~^ between them arises from the spin-orbit splitting of the two J states of 
the ^P term. 

Similar arguments can be applied to the other alkali elements. The spin-orbit energy splittings of 
their first excited states are tabulated in Tabic 6.1. Note that the splitting increases with Z, and that 
the splitting energy is roughly proportional to Z^, as shown in Fig. 6.6. This is an example of the fact 
that spin-orbit interactions generally increase with the atomic number, so that the spin-orbit coupling 
is stronger in heavier elements. 



Element 


Z 


Ground state 


1st excited state 


Transition 


AE (cm-i) 


Lithium 


3 


[He] 2s 


2p 


2p ^2s 


0.33 


Sodium 


11 


[Ne] 3s 


3p 


3p — >3s 


17 


Potassium 


19 


[Ar] 4s 


4p 


4p -;>4s 


58 


Rubidium 


37 


[Kr] 5s 


5p 


5p — )-5s 


238 


Cesium 


55 


[Xc] 6s 


6p 


6p — 7-6s 


554 



Table 6.1: Spin-orbit splitting AE of the D lines of the alkali elements. The energy splitting is equal to 
the difference of the energies of the J = 3/2 and J = 1/2 levels of the first excited state. 



6.6 Spin-orbit coupling in many-electron atoms 

We have seen in Chapter 5 that atoms with more than one valence electron can have different types of 
angular momentum coupling. We restrict our attention here to atoms with LS-coupling, which is the 
most common type, as explained in Section 5.7. In LS-coupling, the residual electrostatic interaction 
couples the orbital and spin angular momenta together according to eqns 5.18 and 5.19. The resultants 
are then coupled together to give the total angular momentum J according to 

J = L + S. (6.45) 

The rules for coupling of angular momenta produce several J states for each LS'-term, with J running 
from L + S down to \L — S\ in integer steps. ^ These J states experience different spin-orbit interactions, 
and so are shifted in energy from each other. Hence the spin-orbit coupling splits the J states of a 
particular Z/5-term into fine structure multiplets. 

The splitting of the J states can be evaluated as follows. The spin-orbit interaction takes the form: 

AE,^ = -/x,pi„ • Bo,bitai cx (L • S) , (6.46) 

which implies (cf. eqns 6.38 - 6.41): 

AEso = Cls [J{J + 1) - L{L + 1) - S(S + 1)] . (6.47) 

It follows from eqn 6.47 that levels with the same L and S but different J are separated by an energy 
which is proportional to J. This is called the interval rule. Figure 5.2 shows an example of the interval 
rule for the ^P term of the (3s, 3p) configuration of magnesium. 



^There is only one J state, and hence no fine structure splitting, when one or both of L or 5 are zero. 



6. 7. NUCLEAR EFFECTS IN ATOMS 



59 




Figure 6.6: Spin-orbit splitting of the first excited state of the alkali atoms versus Z'^, as determined by 
the fine structure splitting of the D-lines. (See Table 6.1.) 

6.7 Nuclear effects in atoms 

For most of the time in atomic physics we just take the nucleus to be a heavy charged particle sitting at 
the centre of the atom. However, careful analysis of the spectral lines can reveal small effects that give 
us direct information about the nucleus. The main effects that can be observed generally fall into two 
categories, namely isotope shifts and hyperfine structure. 

6.7.1 Isotope shifts 

There are two main processes that give rise to isotope shifts in atoms, namely mass effects and field 
effects. 

Mass effects The mass m that enters the Schrodinger equation is the reduced mass, not the bare electron 
mass me (cf. eqn 1.9). Changes in the nuclear mass therefore make small changes to m and hence 
to the atomic energies. 

Field efTects Electrons in s shells have a finite probability of penetrating the nucleus, and are therefore 
sensitive to its charge distribution. 

Both effects cause small shifts in the wavelengths of the spectral lines from different isotopes of the same 
element. The heavy isotope of hydrogen, namely deuterium, was discovered in this way through its mass 
effect. 

6.7.2 Hyperfine structure 

In high-resolution spectroscopy, it is necessary to consider effects relating to the magnetic interaction 
between the electron angular momentum (J) and the nuclear spin (/). The angular momentum of the 
electrons creates a magnetic field at the nucleus which is proportional to J. The spin of the nucleus gives 
it a magnetic dipole moment which is proportional to /, and we therefore have an interaction energy 
term of the form: 

Ail^hyporfinc — " /^nucleus ' -^electron OC (J • J) . (6.48) 
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Figure 6.7: (a) Hyperfine structure of the Is ground state of hydrogen. The arrows indicate the relative 
directions of the electron and nuclear spin, (b) Hyperfine transitions for the sodium Di line, (c) Hyperfine 
transitions for the sodium D2 line. Note that the hyperfine splittings are not drawn to scale. The 
splittings of the sodium levels are as follows: ^Sj/a, 1772 MHz; 2Pi/2, 190 MHz; 2P3/2 (3^2), 59 MHz; 
2P3/2 (2 ^ 1), 34 MHz; (1 ^ 0), 16 MHz. 



This gives rise to hyperfine splittings in the atomic terms. The magnitude of the splittings is very small 
because the nuclear dipole is about 2000 times smaller than that of the electron. This follows from the 
small gyromagnetic ratio of the nucleus, which is inversely proportional to its mass. (See eqn 6.4.) The 
splittings are therefore about three orders of magnitude smaller than the fine structure splittings: hence 
the name "hyperfine" . 

Hyperfine states are labelled by the total angular momentum F of the whole atom (i.e. nucleus plus 
electrons), where 

F = I + J. (6.49) 

In analogy with the \LSJ) states of fine structure, the electric dipole selection rule for transitions between 
hyperfine states is: 

AF = 0,±1, (6.50) 

with the exception that = — > transitions are forbidden. Let us consider two examples to see how 
this works. 

The hydrogen 21 cm Une 

Consider the ground state of hydrogen. The nucleus consists of just a single proton, and we therefore have 
/ — 1/2. The hydrogen ground state is the Is term, which has J = 1/2. The hyperfine quantum 

number F is then found from F — I (B J — 1/2 01/2 = 1 orO. These two hyperfine states correspond to 
the cases in which the spins of the electron and the nucleus are aligned parallel {F = 1) or antiparallel 
{F = 0). The two F states are split by the hyperfine interaction by 0.0475 cm~^ (5.9 x 10~^ eV). (See 
Fig. 6.7(a).) Transitions between these levels occur at 1420 MHz (A — 21cm), and are very important 
in radio astronomy. Radio frequency transitions such as these are also routinely exploited in nuclear 
magnetic resonance (NMR) spectroscopy. 

Hyperfine structure of the sodium D lines 

The sodium D lines originate from 3p — > 3s transitions. As discussed in Section 6.5, there are two lines 
with energies split by the spin-orbit coupling, as indicated in Fig. 6.5(b). 

Consider first the lower energy Di line, which is the ^Pi/2 — >^ S1/2 transition. The nucleus of sodium 
has / = 3/2, and so we have F = 3/2© 1/2 = 2 or 1 for both the upper and lower levels of the transition, 
as shown in Fig. 6.7(b). Note that the hyperfine splittings are not drawn to scale in Fig. 6.7(b): the 
splitting of the ^Si/2 level is 1772 MHz, which is much larger than that of the ■^Pi/2, namely 190 MHz. 
This is a consequence of the fact that s-electrons have higher probability densities at the nucleus, and 
hence experience stronger hyperfine interactions. All four transitions are allowed by the selection rules, 
and so we observe four lines. Since the splitting of the upper and lower levels are so different, we obtain 
two doublets with relative frequencies of (0, 190) MHz and (1772, 1962) MHz. These splittings should be 
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compared to the much larger ('^ 5 x 10^^ Hz) sphtting between the two J states caused by the spin-orbit 
interaction. Since the hyperfine sphttings are much smaller, they are not routinely observed in optical 
spectroscopy, and specialized techniques using narrow band lasers are typically employed nowadays. 

Now consider the higher energy D2 line, which is the ^P3/2 ^1/2 transition. In the upper level 
we have J = 3/2, and hence F = I ® J = 3/2 © 3/2 = 3, 2, 1, or 0. There are therefore four hyperfine 
states for the ^P3/2 level, as shown in Fig. 6.7(c). The hyperfine splittings of the ^P3/2 level are again 
much smaller than that of the ^Si/2 level, on account of the low probability density of p-electrons near 
the nucleus. Six transitions are allowed by the selection rules, with the = 3 — > 1 and = — >■ 2 
transitions being forbidden by the |Ai^| < 1 selection rule. We thus have six hyperfine lines, which split 
into two triplets at relative frequencies of (0, 34, 59) MHz and (1756, 1772, 1806) MHz. 
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Chapter 7 



External fields: the Zeeman and 
Stark eff'ects 



In the previous sets of notes we have been considering the effects of the internal electric and magnetic 
fields within atoms. We now wish to consider the effects of external fields. We shall start by looking at 
magnetic fields and then move on to consider electric fields. 

7.1 Magnetic fields 

The first person to study the effects of magnetic fields on the optical spectra of atoms was Zeeman in 
1896. He observed that the transition lines split when the field is applied. Further work showed that the 
interaction between the atoms and the field can be classified into two regimes: 

• Weak fields: the Zeeman effect, either normal or anomalous; 

• Strong fields: the Paschen-Back effect. 

The "normal" Zeeman effect is so-called because it agrees with the classical theory developed by Lorentz. 
The "anomalous" Zeeman effect is caused by electron spin, and is therefore a completely quantum result. 
The criterion for deciding whether a particular field is "weak" or "strong" will be discussed in Section 7.1.3. 
In practice, we usually work in the weak-field (i.e. Zeeman) limit. 

7.1.1 The normal Zeeman effect 

The normal Zeeman effect is observed in atoms with no spin. The total spin of an A/'-electron atom is 
given by: 

AT 
i=l 

Filled shells have no net spin, and so we only need to consider the valence electrons here. Since all the 
individual electrons have spin 1/2, it will not be possible to obtain S = from atoms with an odd number 
of valence electrons. However, if there is an even number of valence electrons, we can obtain S = states. 
For example, if we have two valence electrons, then the total spin quantum number 5 = l/2®l/2 can be 
either or 1. In fact, the ground states of divalent atoms from group II of the periodic table (electronic 
configuration ns^) always have S = because the two electrons align with their spins antiparallel. 
The magnetic moment of an atom with no spin will originate entirely from its orbital motion: 

M = -^i, (7.2) 

where /ie/^ — e/2mc is the gyromagnetic ratio. (See eqn 6.4.) The interaction energy between a 
magnetic dipole /j, and a uniform magnetic field B is given by: 

AE = -n-B. (7.3) 
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Figure 7.1: The normal Zeeman effect, (a) Splitting of the degenerate mi states of an atomic level with 
Z = 2 by a magnetic field, (b) Definition of longitudinal (Faraday) and transverse (Voigt) observations. 
The direction of the field defines the z axis. 
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Figure 7.2: The normal Zeeman effect for a p — > d transition, (a) The field splits the degenerate m; 
levels equally. Optical transitions can occur if Am; — 0,±1. (Only the transitions originating from the 
m; = level of the I = 1 state are identified here for the sake of clarity.) (b) The spectral line splits into a 
triplet when observed transversely to the field. The Am; = transition is unshifted, but the Am; = ±1 
transitions occur at {Iivq ^ (IbBz)- 



We set up the axes of our spherically-symmetric atom so that the z axis coincides with the direction of 
the field. In this case we have: 

B = 

and the interaction energy of the atom is therefore: 

AE = ~fi,B, = fiBB.mi , (7.4) 

where mi is the orbital magnetic quantum number. Equation 7.4 shows us that the application of an 
external S-field splits the degenerate m; states evenly. This is why mi is called the magnetic quantum 
number. The splitting of the m; states of an ^ = 2 electron is illustrated in Fig. 7.1(a). 

The effect of the magnetic field on the spectral lines can be worked out from the splitting of the levels. 
Consider the transitions between two Zeeman-split atomic levels as shown in Fig. 7.2. The selection rules 
listed in Table 3.1 of Chapter 3 indicate that we can have transitions with Ami = or ±1. The gives 
rise to three transitions whose frequencies are given by: 

hv — hi>Q + iiBBz Ami = ~l , 

hv — Hvq Ami = , (7.5) 

hv — Hvq — ^bBz Ami = +1 ■ 
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Am; 


Energy 


Polarization 
Longitudinal Transverse 
observation observation 


+1 


huo - HbB 


a+ 


£ LB 







not observed 


S II B 


-1 


hvo + Mb-B 


a~ 


S LB 



Table 7.1: The normal Zeeman effect. The last two columns refer to the polarizations observed in 
longitudinal (Faraday) and transverse (Voigt) observation conditions. The direction of the circular (cr^) 
polarization in longitudinal observation is defined relative to B. In transverse observation, all lines are 
linearly polarized. 

This is the same result as that derived by classical theory. 

The polarization of the Zeeman lines is determined by the selection rules, and the conditions of obser- 
vation. If we are looking along the field (longitudinal observation), the photons must be propagating in 
the z direction. (Sec Fig. 7.1(b).) Light waves are transverse, and so only the x and y polarizations are 
possible. The z-polarized Am; = line is therefore absent, and we just observe the C7+ and cr~ circularly 
polarized Am; = ±1 transitions. When observing at right angles to the field (transverse observation), 
all three lines are present. The Am; = transition is linearly polarized parallel to the field, while the 
Am; = ±1 transitions are linearly polarized at right angles to the field. These results are summarized in 
Table 7.1.^ 

7.1.2 The anomalous Zeeman effect 

The anomalous Zeeman effect is observed in atoms with non-zero spin. This will include all atoms with 
an odd number of electrons. In the LS-coupling regime, the spin-orbit interaction couples the spin and 
orbital angular momenta together to give the resultant total angular momentum J according to: 

J = L + S. (7.6) 

The orbiting electrons in the atom are equivalent to a classical magnetic gyroscope. The torque applied 
by the field causes the atomic magnetic dipole to precess around B, an effect called Larmor precession. 
The external magnetic field therefore causes J to precess slowly about B. Meanwhile, L and S precess 
more rapidly about J due to the spin-orbit interaction. This situation is illustrated in Fig. 7.3(a). The 
speed of the precession about B is proportional to the field strength. If we turn up the field, the Larmor 
precession frequency will eventually be faster than the spin-orbit precession of L and S around J. This 
is the point where the behaviour ceases to be Zeeman-like, and we are in the strong field regime of the 
Paschen-Back effect. 

The interaction energy of the atom is equal to the sum of the interactions of the spin and orbital 
magnetic moments with the field: 

AE = -M.B, = -(m|p- + /i-'''*-')B, = {g,S, + L,) ^B, , (7.7) 

where gg = 2, and the symbol (• • • ) implies, as usual, that we take expectation values. The normal 
Zeeman effect is obtained by setting 8^=0 and = miH in this formula. In the case of the precessing 
atomic magnet shown in Fig. 7.3(a), neither Sz nor arc constant. Only = Mjh is well-defined. 
We must therefore first project L and S onto J, and then re-project this component onto the z axis. 
The effective dipole moment of the atom is therefore given by: 

/X = - ^ I L I cos ^1 ^ + 2| S I cos 02 ^ ^ ^ , (7.8) 

^In solid-state physics, the longitudinal and transverse observation conditions are frequently called the Faraday and 
Voigt geometries, respectively. 
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Figure 7.3: (a) Slow precession of J around B in the anomalous Zeeman effect. The spin-orbit interaction 
causes L and S to precess much more rapidly around J. (b) Definition of the projection angles 9i and 
02 used in the calculation of the Lande g factor. 



where the factor of 2 in the second term comes from the fact that gs = 2. The angles 9i and 02 that 
appear here are defined in Fig. 7.3(b), and can be calculated from the scalar products of the respective 
vectors: 

L J = |i| |J|cos6'i , 

S-J = l^l |J|cos6l2, (7.9) 

which implies that: 

Now equation 7.6 implies that S = J L, and hence that: 

S ■ S={J- L)-{J L) = J ■ J + L ■ L 2L ■ J . 



We therefore find that: 
so that: 



L • J = {J • J + L • L - S • S)/2, 
L-J\ _ [J{J +l) + L{L + \)- S{S + l)]h^l2 



Similarly: 
and so: 



|JP / J{J + l)K^ 

[J(J+1) + L(L+1)- 5(5 + 1)] 
2J(J+1) 

S-J = {J-J + S-S-L-L)/2, 

S-J\ _ [J{J + 1) + 5(5 + 1) - L{L + l)]h^/2 

J{J + l)h^ 
[,/(J + l) + 5(5+l)-i(L + l)] 



2J(J + 1) 



We therefore conclude that: 



(7.11) 



(7.12) 



f [JiJ + l)+L{L + l)-S{S+l)] , JJ(J+1) + 5(5+1)-£(£ + 1)] Vb , .„^„. 
[ 2J(J + 1) +^ 2J(J+1) Jh-^- ^^-^"^^ 

This can be written in the form: 

t^ = -gj^J. (7.14) 

where gj is the Lande g-factor given by: 

^^^,^JiJ + ^) + Si^S+l)-LiL + l) 

2J(J + 1) ^ ' 

This implies that 

M2 = -5j/^bM/, (7.16) 
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Level 


J 


L 


5* 


9j 


'P3/2 


3/2 


1 


1/2 


4/3 




1/2 


1 


1/2 


2/3 




1/2 





1/2 


2 



Table 7.2: Lande g-factors evaluated from eqn 7.15 for the levels involved in the sodium D lines. 



and hence that the interaction energy with the field is: 

AE = -n,B, = qjUbB^Mj . (7.17) 

This is the final result for the energy shift of an atomic state in the anomafous Zccman effect. Note that 
we just obtain gj = 1 if S = 0, as wc would expect for an atom with only orbital angular momentum. 
Similarly, if L = so that the atom only has spin angular momentum, we find gj — 2. Classical theories 
always predict gj = 1. The departure of gj from unity is caused by the spin part of the magnetic moment, 
and is a purely quantum effect. 

The spectra can be understood by applying the following selection rules on J and Mj: 

AJ = 0,±1; 

AMj = 0,±1. 

These rules have to be applied in addition to the Al = ±1 and AS = rules. (See discussion in § ??.)^ 

AJ = transitions are forbidden when J for both states, and AMj = transitions are forbidden in 
a A J = transition. The transition energy shift is then given by: 

hAf = {hv — hvo) , 

where hv^) is the transition energy at = and the superscripts refer to the upper and lower states 
respectively. 

The polarizations of the transitions follow the same patterns as for the normal Zeeman effect: 

• With longitudinal observation the AMj = transitions are absent and the AMj = ±1 transitions 
are circularly polarized. 

• With transverse observation the AMj = transitions are linearly polarized along the z axis (i.e. 
parallel to B) and the AMj = ±1 transitions are linearly polarized in the x-y plane (i.e. perpen- 
dicular to B). 



Example: The sodium D lines 

The sodium D lines correspond to the 3p — >■ 3s transition. At B = 0, the spin-orbit interaction splits the 
upper 3p term into the ^^3/2 and levels separated by 17cm~^. The lower level has no 

spin-orbit interaction. The Lande g-factors of the levels worked out from eqn 7.15 are given in Table 7.2. 

The splitting of the lines in the field is shown schematically in Fig. 7.4. The ^^3/2 level splits into 
four Mj states, while the two J = 1/2 levels each split into two states. The splittings are different for 
each level because of the different Lande factors. On applying the AMj = 0,±1 selection rule, we find 
four allowed transitions for the Di line and six for the D2. These transitions arc listed in Table 7.3. 

The results tabulated in Table 7.3 can be compared to those predicted by the normal Zeeman effect. In 
the normal Zeeman effect we observe three lines with an energy spacing equal to /j-bB. In the anomalous 
effect, there are more than three lines, and the spacing is different to the classical value: in fact, the lines 
are not evenly spaced. Furthermore, none of the lines occur at the same frequency as the unperturbed 
line at B = 0. 



^There are no selection rules on and Ms here because Lz and Sz are not constants of the motion when L and S are 
coupled by the spin-orbit interaction. 
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Figure 7.4: Splitting of the sodium D-lines by a weak magnetic field. Note that the Zeeman splittings 
are smaller than the spin-orbit splitting, as must be the case in the "weak" field limit. 



7.1.3 The Paschen-Back effect 

The Paschen-Back effect is observed at very strong magnetic fields. The criterion for observing the 
Paschen-Back effect is that the interaction with the external magnetic field should be much stronger than 
the spin-orbit interaction: 

MbS, »A£;,o. (7.19) 

If we satisfy this criterion, then the precession speed around the external field will be much faster than 
the spin-orbit precession. This means that the interaction with the external field is now the largest 
perturbation, and so it should be treated first, before the perturbation of the spin-orbit interaction. 

Another way to think of the strong-field limit is that it occurs when the external field is much stronger 
than the internal field of the atom arising from the orbital motion. We saw in Section 6.3 that the internal 
fields in most atoms are large. For example, the Bohr model predicts an internal field of 12 T for the 
n = 1 shell of hydrogen. (See eqn 6.16.) This is a very strong field, that can only be obtained in the 
laboratory by using powerful superconducting magnets. This internal field strength is typical of many 
atoms, and so it will frequently be the case the field required to observe the Paschen-Back effect is so 
large that we never go beyond the Zeeman regime in the laboratory.'^ For example, in sodium, the field 
strength equivalent to the spin-orbit interaction for the D-lines is given by: 



AE, 



'so 



17 cm 



9.27 X 10-24 



JT"^ 



= 36T, 



which is not achievable in normal laboratory conditions. On the other hand, since the spin-orbit interac- 
tion decreases with the atomic number Z, the splitting for the equivalent transition in lithium (i.e. the 
2p — > 2s transition) is only 0.3 cm"^. This means that we can reach the strong field regime for fields 
^ 0.6 T. This is readily achievable, and allows the Paschen-Back effect to be observed. 

In the Paschen-Back effect, the spin-orbit interaction is assumed to be negligibly small, and L and S 
are therefore no longer coupled together. Each precesses separately around B, as sketched in Fig. 7.5. 
The precession rates for L and S are different because of the different g- values. Hence the magnitude of 
the resultant J varies with time: the quantum number J is no longer a constant of the motion. 

The interaction energy is now calculated by adding the separate contributions of the spin and orbital 
energies: 



AE = -^i,B, = -(/i 



orbital 



spin' 



B, = (Ml + gsMs)fiBB, 



(7.20) 



■^There are extremely large magnetic fields present in the Sun due to the circulating plasma currents. This means that 
the Paschen-Back elfect can be observed for elements like sodium in solar spectra. 
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^3 
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2 


+1 




-1 



Table 7.3: Anomalous Zeeman effect for the sodium D lines. The transition energy shifts are worked out 
from eqn 7.18 and are quoted in units of hbBz- 



B 




Figure 7.5: Precession of L and S around B in the Paschen-Back effect. 



The shift of the spectral lines is given by: 

A{hi^) = {AMl + gsAMs)tiBB, . (7.21) 

We have noted before that optical transitions do not affect the spin, and so we must have AMs — 0. The 
frequency shift is thus given by: 

A(H=MbS,AMl, (7.22) 
where AMl = or ±1. In other words, we revert to the normal Zeeman effect. 



Putting it all together 

The change of the spectra as we increase B from zero is illustrated for the p -> s transitions of an alkali 
atom in Fig. 7.6. At i? = the lines are split by the spin-orbit interaction. At weak fields we observe 
the anomalous Zeeman effect, while at strong fields we change to the Paschen-back effect. 



7.1.4 Magnetic field eff'ects for hyperfine levels 

Everything we have said so far has ignored the hyperfine structure of the atom. The whole process can 
be repeated to calculate the Zeeman and Paschen-Back energy shifts for the hyperfine levels. In this 
case, the energy splittings at _B = are much smaller, due to the much smaller gyromagnetic ratio of the 
nucleus compared to the electron. (See Section 6.7.2.) This implies that the change from the weak-field 
to the strong-field limit occurs at much smaller field strengths than for the states split by fine-structure 
interactions. We shall not consider the hyperfine states further in this course. 
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Figure 7.6: Schematic progression of the optical spectra for the p — >■ s transitions of an alkah atom with 
increasing field. 



7.2 The concept of 'good' quantum numbers 

It is customary to refer to quantum numbers that relate to constants of the motion as 'good' quantum 
numbers. In this discussion of the effects of magnetic fields, we have used six different quantum numbers 
to describe the angular momentum state of the atom: J, Mj, L, Ml, S, Ms- However, we cannot know 
all of these at the same time. In fact, we can only know four: (i, S, J, Mj) in the weak-field limit, or 
(L, 5, Ml, Ms) in the strong-field limit. In the weak-field limit, L^ and are not constant which implies 
that J and Mj are 'good' quantum numbers but Ml and Ml are not. Similarly, in the strong-field limit, 
the coupling between L and S is broken and so J and Jz are not constants of the motion: Ml and Ms 
are good quantum numbers, but J and Mj are not. 

A similar type of argument applies to the two angular momentum coupling schemes discussed in 
Section 5.6, namely LS-coupling and jj-coupling. As an example, consider the total angular momentum 
state of a two electron atom. In the LS-coupling scheme, we specify {L, S, J, Mj), whereas in the jj- 
coupling scheme we have (ji, j2, J, Mj). In both cases, we have four 'good' quantum numbers, which tell 
us the precisely measurable quantities. The other quantum numbers are unknown because the physical 
quantities they represent are not constant. In LS-coupling we cannot know the j values of the individual 
electrons because the residual electrostatic potential overpowers the spin-orbit effect, whereas in the jj- 
coupling scheme we cannot know L and S. Note, however, that J and Mj are good quantum numbers 
in both coupling limits. This means that we can always describe the Zeeman energy of the atom by 
eqn 7.17, although in the case of jj-coupling, the formula for the gj factor given in cqn 7.15 will not be 
valid because L and S are not good quantum numbers. 



7.3 Electric fields 



The shifting and splitting of spectral lines in an electric field is called the Stark efTect. This effect 
was first observed in 1913. In most atoms we observe the quadratic Stark effect and we therefore 
consider this effect first. We then move on to consider the linear Stark effect, which is observed for 
the excited states of hydrogen, and in other atoms at very strong fields. The Stark shift of an atom is 
harder to observe than the Zeeman shift, which explains why magnetic effects are more widely studied in 
atomic physics. However, large Stark effects are readily observable in solid state physics, and we therefore 
conclude by briefly considering the quantum-confined Stark effect. 



7.3.1 The quadratic Stark effect 

Most atoms show a small red shift (i.e. a shift to lower energy) which is proportional to the square of the 
electric field. This phenomenon is therefore called the quadratic Stark effect. The energy of an atom in 
an electric field £ is given by 

E = -p-£, (7.23) 
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Figure 7.7: Effect of an electric field £ on the electron cloud of an atom, (a) When £ = 0, the negatively- 
charged electron cloud is arranged symmetrically about the nucleus, and there is no electric dipole. (b) 
When the electric field is applied, the electron cloud is displaced, and a net dipole parallel to the field is 
induced. 



where p is the electric dipole of the atom. We can understand the quadratic Stark effect intuitively with 
reference to Fig. 7.7. The negatively-charged electron clouds of an atom are spherically symmetric about 
the positively-charged nucleus in the absence of applied fields. A charged sphere acts like a point charge 
at its centre, and it is thus apparent that atoms do not normally possess a dipole moment, as shown in 
Fig. 7.7(a). When a field is applied, the electron cloud and the nucleus experience opposite forces, which 
results in a net displacement of the electron cloud with respect to the nucleus, as shown in Fig. 7.7(b). 
This creates a dipole p which is parallel to £ and whose magnitude is proportional to \£\. This can be 
expressed mathematically by writing: 

p = a£, (7.24) 

where a is the polarizability of the atom. The energy shift of the atom is found by calculating the 
energy change on increasing the field strength from zero: 

£ £ 

AE = - [ p-d£'^-[ a£' d£' ^ -\a£^ , (7.25) 
Jo Jo 2 

which predicts a quadratic red shift, as required. The magnitude of the red shift is generally rather small. 
This is because the electron clouds are tightly bound to the nucleus, and it therefore requires very strong 
electric fields to induce a significant dipole. 

We can understand the quadratic Stark effect in more detail by applying perturbation theory.** The 
perturbation caused by the field is of the form: 

H' = -Y,i^en)-£, 

(7.26) 



where the field is assumed to point in the +z direction. In principle, the sum is over all the electrons, 
but in practice, we need only consider the valence electrons, because the electrons in closed shells are 
very strongly bound to the nucleus and are therefore very hard to perturb. In writing eqn 7.26, we take, 
as always, rj to be the relative displacement of the electron with respect to the nucleus. 

For simplicity, we shall just consider the case of alkali atoms which possess only one valence electron. 
In this case, the perturbation to the valence electron caused by the field reduces to: 

H' = e£z. (7.27) 

The first-order energy shift is given by: 

AE = (V'liJ'lV') = e£(V'k|V') , (7.28) 

where 

i;*z^d^r. (7.29) 

all space 



^Many of you will not have done perturbation theory yet, as it is normally first encountered in detail in course PHY309, 
which is taken in the second semester. You will therefore have to take the results presented here on trust. 
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Now unperturbed atomic states have definite parities. (See discussion in Section 3.4.) The product 
tp*tp = 1 jg therefore an even function, while z is an odd function. It is therefore apparent that 

{ip\z\ip) = /// (even function) x (odd function) d^r = . 

J J J a\\ space 

The first-order energy shift is therefore zero, which explain why the energy shift is quadratic in the field, 
rather than linear. 

The quadratic energy shift can be calculated by second-order perturbation theory. In general, the 
energy shift of the «th state predicted by second-order perturbation theory is given by: 

^ Ei - Ej 

where the summation runs over all the other states of the system, and Ei and Ej are the unperturbed 
energies of the states. The condition of validity is that the magnitude of the perturbation, namely 
\{ipi\H'\ipj)\, should be small compared to the unperturbed energy splittings. For the Stark shift of the 
valence electron of an alkali atom, this becomes: 

Ai^. = e^£^V 'y^^^r (7.31) 

We see immediately that the shift is expected to quadratic in the field, which is indeed the case for most 
atoms. 

As a specific example, we consider sodium, which has a single valence electron in the 3s shell. We 
first consider the ground state 3s^Si/2 term. The summation in eqn 7.31 runs over all the excited states 
of sodium, namely the 3p, 3d, 4s, 4p, . . . states. Now in order that the matrix element (V'il-^lV'j) should 
be non-zero, it is apparent that the states i and j must opposite parities. In this case, we would have: 

{ipi\z\ipj) = / // (even/odd parity) x (odd parity) x (odd/even parity) d^r 7^ , 

J J all space 

since the integrand is an even function. On the other hand, if the states have the same parities, we have: 
{ipi\z\ipj) = / / / (even/odd parity) x (odd parity) x (even/odd parity) d^r = , 

J J 7 all space 

since the integrand is an odd function. Since the parity varies as (—1)', the s and d states do not 
contribute to the Stark shift of the 3s state, and the summation in eqn 7.31 is only over the p and f 
excited states. Owing to the energy difference factor in the denominator, the largest perturbation to the 
3s state will arise from the first excited state, namely the 3p state. Since this lies above the 3s state, the 
energy difference in the denominator is negative, and the energy shift is therefore negative. Indeed, it is 
apparent that the quadratic Stark shift of the ground state of an atom will always be negative, since the 
denominator will be negative for all the available states of the system. This implies that the Stark effect 
will always correspond to a red shift for the ground state level. 

There is no easy way to calculate the size of the energy shift, but we can give a rough order of 
magnitude estimate. If we neglect the contributions of the even parity excited states above the 3p state, 
the energy shift will be given by: 



The expectation value of z over the atom must be smaller than a, where a is the atomic radius of sodium, 
namely 0.18 nm. Hence with E^p — E^s = 2.1 eV, we then have: 

which implies from eqn 7.25 that a^g ^ 3.2 x 10"^° eVm^ V"^. This predicts a shift of < — 1 x 10^^ eV 
(— O.OScm^^) in a field of 2.5 x 10^ V/m, which compares reasonably well with the experimental value of 
-0.6 X 10-5 eV (-0.05 cm-i). 
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Figure 7.8: (a) Shift of the ^Si/2, and ^P3/2 terms of an alkah atom in an electric field. Note 

that the red shifts of the upper levels are larger than that of the lower level, (b) Red shift of the Di 
(2Pi/2 and the D2 (^P3/2 ^ ^Si/2) lines in the field. 



The order of magnitude calculation given above can also provide a useful estimation of the field 
strength at which the second-order perturbation approximation breaks down. As mentioned above, 
this will occur when the magnitude of the perturbation become comparable to the unperturbed energy 
splitting, that is when: 

e£\{^P3s\z\^3p)\ ^ (Esp-Ess). 

On setting | (?/'3s|z|?/'3p) | = a as before, we find £ ^ 10^"V/m, which is an extremely large field. The 
second-order perturbation approach will therefore be a good approximation in most practical situations. 

Now consider the Stark shift of the 3p state. The 3p state has odd parity, and so the non-zero 
contributions in eqn 7.31 will now arise from the even parity ns and nd states: 

Esp — E^p — i?3d E^p — E^s J 

The first term gives a positive shift, while all subsequent terms are negative. Therefore, it is not imme- 
diately obvious that the Stark shift of excited states like the 3p state will be negative. However, since 
the energy difference of the excited states tends to get smaller as we go up the ladder of levels, it will 
generally be the case that the negative terms dominate, and we have a red shift as for the ground state. 
Moreover, the red shift is generally expected to be larger than that of the ground state for the same 
reason (i.e. the smaller denominator). In the case of the 3p state of sodium, the largest contribution 
comes from the 3d state which lies 1.51 eV above the 3p state, even though the 4s state is closer (relative 
energy -1-1.09 eV). This is because of the smaller value of the matrix element for the s states. 

Explicit evaluation of the matrix elements indicates that the Stark shift at a given field strength 
depends on Afj. This means that electric fields do not completely break the degeneracy of the Mj 
sub-levels of a particular |J) term. This contrasts with the Zeeman effect, where the energy shift is 
proportional to Mj, and the degeneracy if fully lifted. The Stark shift of the sodium D lines is shown 
schematically in Fig. 7.8. All states are shifted to lower energy, with those of the same Mj values being 
shifted equally for a given level, as indicated in Fig. 7.8(a). The shifts of the upper 3p levels are larger 
than that of the lower 3s ^Si/2 term, and both spectral lines therefore show a net shift to lower energy, 
as indicated in Fig. 7.8(b). Owing to the degeneracy of the sub-levels with the same \Mj\, the Di 
(^Pi/2 ^Si/2) fine does not split, while the D2 (^P3/2 — >■ ^Si/2) line splits into a doublet. 

An interesting consequence of the perturbation caused by the electric field is that the unperturbed 
atomic states get mixed with other states of the opposite parity. For example, the 3s state has even 
parity at f = 0, but acquires a small admixture of the odd parity 3p state as the field is increased. 
This means that parity forbidden transitions (eg s— >-s, p— ^-p, d— )-s, etc.) become weakly allowed as the 
field is increased. Since we are dealing with a second-order perturbation, the intensity of these forbidden 
transitions increases in proportion to f^. 
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7.3.2 The linear Stark effect 

Stark's original experiment of 1913 was performed on the Balmer lines of hydrogen.^ In contrast to 
what has been discussed in the previous subsection, the shift was quite large, and varied linearly with 
the field. The reason for this is that the I states of hydrogen are degenerate. This means that we have 
states of opposite parities with the same energy, so that the second-order energy shift given by eqn 7.31 
diverges. We therefore have to take a new approach to calculate the Stark shift by employing degenerate 
perturbation theory. 

Consider first the Is ground state of hydrogen. This level is unique, and hence the second-order 
perturbation approach is valid. A small quadratic red-shift therefore occurs, as discussed in the previous 
sub-section. 

Now consider the n = 2 shell, which has four levels, namely the m = level from the 2s term, and 
the m = —1, 0, and -|-1 levels of the 2p term. In the absence of an applied field, these four levels are 
degenerate. If the atom is in the n = 2 shell, it is equally likely to be in any of the four degenerate levels. 
We must therefore write its wave function as: 

4 

V'n=2=^CiVi, (7.32) 
i=l 

where the subscript i identifies the quantum numbers {n,l,m}, that is: 

■01 = '02,0,0 ; V'2 = V'2,1,-1 ; V'3 = ^"2,1,0 ; ^4 = ■^2,1,+1 ■ 
The first-order energy shift from eqn 7.28 becomes: 

AE = e£Y^ CiCj{iPi\z\^j) . (7.33) 

Unlike the case of the ground state, we can see from parity arguments that some of the matrix elements 
are non-zero. For example, tl>i has even parity, but tp^ has odd parity. We therefore have: 



J J J all space 

(even parity) x (odd parity) x (odd parity) d^r 

all space 



^ 0. 



This implies that we can observe a imear shift of the levels with the field. It turns out that (V'll-^IV's) is 
the only non-zero matrix element. This is because the perturbation H' = e£z commutes with L^, and so 
the only non-zero matrix elements are those between states with the same m value but opposite parity, 
that is, between the two m = levels derived from the 2s and 2p states. 

It can easily be evaluated from the hydrogenic wave functions of the n = 2 levels given in Tables 2.1 
and 2.2 that: 

(V'll^lV's) = -3ao, 

where ao is the Bohr radius of hydrogen. We then find by degenerate perturbation theory that the 
field splits the n = 2 shell into a triplet, with energies of — Seaof, 0, and -\-^eao£ with respect to the 
unperturbed level. Note that this shift is linear in the field and has a much larger magnitude than that 
calculated for the quadratic Stark effect. For example, at f = 2.5 x 10^ V/m, we find shifts of ±4 x 10~^ eV 
(32cm~^), which are more than two orders of magnitude larger than the shifts of the levels in sodium 
at the same field strength. This, of course, explains why the linear Stark effect in hydrogen was the first 
electric-field induced phenomenon to be discovered. 

It was mentioned in Section 7.3.1 that the second-order perturbation analysis is expected to break 
down at large field strengths when the field-induced perturbation becomes comparable to the splittings 
of the unperturbed levels. We made an estimate of this for the 3s level of sodium and concluded that 
extremely large fields were required for the strong-field limit to be reached. However, the fields required 
for the breakdown of the second-ordcir approach for the excited states can be significantly smaller, because 
some atoms can have different parity excited states which are relatively close to each other. We would 
then expect the behaviour to change as the field is increased. At low fields we would observe the quadratic 



^The Balmer series of hydrogen corresponds to those lines that terminate on the n = 2 level. These lines occur in the 
visible spectral region. 
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Energy quantum well 




(a) (b) (C) 

Figure 7.9: The quantum confined Stark effect, (a) Excitons are created by optical transitions from the 
valence to the conduction band of a semiconductor, (b) A quantum well is formed when a thin layer of a 
semiconductor with a band gap Eg is sandwiched between layers of another semiconductor with a larger 
band gap E'^. (c) Electric fields are applied to an exciton in a quantum well by embedding the quantum 
well within a P-N junction and applying reverse bias. 



Stark effect, but when the field is sufficiently large that the perturbation is comparable to the energy 
splitting, we would effectively have degenerate levels with different parities. This would then result in a 
linear shift determined by degenerate perturbation theory. This change from the quadratic Stark effect at 
low fields to the linear Stark effect at high fields was first studied for the (Is, Al) excited state configuration 
of helium by Foster in 1927. 

7.3.3 The quantum-confined Stark eff'ect 

An optical transition between the valence and conduction bands of a semiconductor leaves a positively- 
charged hole in the valence band, and a negatively-charged electron in the conduction band, as shown in 
Fig. 7.9(a). The electron and hole can bind together to form a hydrogen-like atom called an exciton. 
The binding energy of the exciton is rather small, due to the high relative dielectric constant Cr of the 
semiconductor, and also because of the low reduced effective mass of the exciton. Typical values might 
be ej. ^ 10 and m ~ O.lwe, which implies from eqn 1.10 that the Is binding energy would be ^ 0.01 cV.^ 

From the discussion given in Section 7.3.1, we would expect the Is exciton state to show a quadratic 
Stark shift as an electric field is applied. However, in bulk semiconductors the excitons are very unstable 
to applied electric fields due to their low binding energy, which implies that the electrostatic force between 
the electron and hole is relatively small. The electrons and holes are pushed in opposite directions, and 
the exciton then easily gets ripped apart by the field. This effect is called field ionization. It can also 
be observed in atomic physics, but only at extremely high field strengths. 

The situation in a quantum- confined structure such as a semiconductor quantum well or quantum 
dot is rather different. Consider the case of the quantum well shown in Fig. 7.9(b). The quantum well is 
formed by sandwiching a thin semiconductor with a band gap of Eg between layers of another semicon- 
ductor with a larger band gap Eg. This then gives rise to spatial discontinuities in the conduction and 
valence band energies as shown in the figure. The excitons that are formed by optical transitions across 
the smaller band gap are then trapped in the finite potential well created by the band discontinuities. 

A strong electric field can be applied to the quantum well by embedding it within a P-N junction, and 
then applying reverse bias, as shown in Fig. 7.9(c). P-N junctions conduct when forward bias is applied, 
but not under reverse bias. In the latter case, the applied voltage is dropped across the narrow junction 
region, thereby generating an electric field that is controlled by the reverse bias. The excitons that are 
created by optical transitions are now stable to the field, because the barriers of the quantum well prevent 
them from being ripped apart. The electrons are pushed to one side of the quantum well, and the holes 
to the other, which creates a dipole of magnitude ~ ed, where d is the width of the quantum well. With 
d ^ 10 nm, much larger dipoles can be created than in atomic physics, resulting in correspondingly larger 



Note that the factor of in the denominator of eqn 1.10 has to be replaced by (erco)^ in ^ dielectric medium. 
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Stark shifts. This effect is caUed the quantum-conflned Stark effect, and is widely used for making 
electro-optical modulators. The quantum-confined Stark effect will be studied in more detail in course 
PHY475. 
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Chapter 8 

Helium and exchange symmetry 



In this chapter we will look at atoms with two valence electrons. This includes helium, and the group 
II elements: beryllium, magnesium, calcium, etc. As we will see, this leads to the idea of the exchange 
energy. We shall use helium as the main example, as it is a true two electron system and illustrates the 
physical points most clearly. 

8.1 Exchange symmetry 

Consider a multi-electron atom with N electrons, as illustrated in figure 8.1(a). The wave function of the 
atom will be a function of the co-ordinates of the individual electrons: 

= *(ri,r2, • • • ,rK,rL,- ■■vn) 

However, the electrons are indistinguishable particles. It is not physically possible to stick labels on 
the individual electrons and then keep tabs on them as the move around their orbits. This means that 
the many-electron wave function must have exchange symmetry: 

|*(ri,r2, • • • ,rK,rL,- ■■rN)f = |*(ri,r2, • • • ,rL,rK,- • -r-Ar)!^ . (8.1) 

This says that nothing happens if we switch the labels of any pair of electrons. Equation 8.1 will be 
satisfied if 

*(ri,r2, • • • ,rK,rLr • -rAr) = ± ^'(ri,r2, • • • ,rL,rK,- --riy) . (8.2) 

The -I- sign in equation 8.2 applies if the particles are bosons. These are said to be symmetric with 
respect to particle exchange. The — sign applies to fermions, which are anti-symmetric with respect 
to particle exchange. 

Electrons have spin 1/2 and are therefore fermions. Hence the wave function of a multi-electron atom 
must be anti- symmetric with respect to particle exchange. This is a very fundamental property, and is 
the physical basis of the Pauli exclusion principle, as we shall see below. 

The discussion of exchange symmetry gets quite complicated when there are lots of electrons, and so 
we shall just concentrate on helium here. 

8.2 Helium wave functions 

Figure 8.1(b) shows a schematic diagram of a helium atom. It consists of one nucleus with Z = 2 and 
two electrons. The position co-ordinates of the electrons are written ri and r2 respectively. 




Figure 8.1: (a) A multi-electron atom with TV electrons, (b) The helium atom. 
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V'spatial 


V'spin 


symmetric 
anti-symmetric 


anti-symmetric {S = 0) 
symmetric {S = 1 



Table 8.1: Allowed combinations of the exchange symmetries of the spatial and spin wave functions of 
fermionic particles. 

The quantum state in the helium atom will be specified both by the spatial co-ordinates and by the 
spin of the two electrons. The two-electron wave function is therefore written as a product of a spatial 
wave function and a spin wave function: 

* = V'spatial(T'l,T'2)'0spin • (8.3) 

As we have seen above, the fact that electrons are indistinguishable fermions requires that the two-electron 
wave function 'i' must be anti-symmetric with respect to exchange of electrons 1 and 2. Table 8.1 lists the 
two possible combinations of wave function symmetries that can produce an antisymmetric total wave 
function. 

Let us first consider the spatial wave function. The state of the atom will be specified by the con- 
figuration of the two electrons. In the ground state both electrons are in the Is shell, and so we have 
a configuration of Is^. In the excited states, one or both of the electrons will be in a higher shell. The 
configuration is thus given by the n,l values of the two electrons, and we write the configuration as 
(ni?i, n2^2). This means that the spatial part of the helium wave function must contain terms of the 
type UA{ri) UB{r2), where Uni(r) is the wave function for an electron with quantum numbers n and I, 
and the subscripts A and B stand for the quantum numbers n, I of the two electrons. 

The discussion above does not take account of the fact that the electrons are indistinguishable: we 
cannot distinguish between the state with electron 1 in state A and electron 2 in state B, and vice versa. 
Ub(t"i) UA{r2) is therefore an equally valid wave function for the particular electronic configuration. The 
wave function for the configuration A, B must therefore take the form: 

■4'AB{ri,r2) = (uA{ri) UB{r2) ± 'Ws(r-i) UA(r2)) • (8.4) 

The I/V2 factor ensures that 4'AB{i'i,r2) is correctly normalized. It is easy to verify that the wave 
function with the -|- sign is symmetric with respect to particle exchange, while the wave function with 
the — sign is antisymmetric. From Table 8.1, we see that these must be paired off with anti-symmetric 
and symmetric spin wave functions respectively. 

Let us now consider the spin wave function. We have two spin 1/2 electrons, and so the total spin 
quantum number S is given by S' = 1/2 ® 1/2 = 1 or 0. S = states are called singlets because they 
only have one possible Mg value, namely 0, while S = 1 states are called triplets because they have 
three possible Mg values, namely -|-1, 0, and —1. 

There are four possible ways of combining the spins of the two electrons so that the total wave function 
has exchange symmetry. These are listed in Table 8.2. The component of S along the z-axis is obtained 
by adding together the Sz values of the individual electrons. This gives the Sz value of the whole helium 
atom, and hence the spin quantum number Ms- 

Inspection of Table 8.2 shows us that we have three symmetric spin states with M5 values of -|-1, 
and —1 and one antisymmetric spin wave function with Mg = 0. The Mg = +1 and Mg = — 1 wave 
functions are derived unambiguously from the triplet state. On the other hand, the two Adg = wave 
functions could come from either the singlet or triplet states. However, the spin states must have well- 
defined exchange symmetries, and the Mg = ±1 wave functions are clearly symmetric. This implies that 
the symmetric Mg = wave function comes from the triplet state, and hence that the anti-symmetric 
wave function corresponds to the singlet state. We thus conclude that triplet states have symmetric 
spin wave functions, while singlets have anti-symmetric spin wave functions. We then see from Table 8.1 
that singlets and triplets must be paired off with symmetric and antisymmetric spatial wave functions 
respectively. 

The final conclusions about the wave functions are summarized in Table 8.3. The essential point is 
that the singlets and triplets have different spatial wave functions. This has a strong effect on the energy 
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Spin wave function 


symmetry 


Ms 


1 i \ z 


+ 


+1 


^(tl i2+U t2) 


+ 





:^(ti 4-2 - 4-1 12) 







il 4-2 


+ 


-1 



Tabic 8.2: Spin wave functions for a two-clcctron system. The arrows indicate whether the spin of the 
individual electrons is up or down (ie +1 or — |)- The + sign in the symmetry column applies if the 
wave function is symmetric with respect to particle exchange, while the — sign indicates that the wave 
function is anti-symmetric. The Sz value is indicated by the quantum number for Ms, which is obtained 
by adding the nis values of the two electrons together. 



S Ms 


V'spin 


V'spatial 





:^(ti 42 — 4l t2) 


^ (uA{ri) UB{r2) + usiri) UA{r2)^ 


+1 

1 

-1 


tl t2 
^(tl 42+4l t2) 


(uA{ri) UB{r2) - UB{ri) UAir^)^ 



Table 8.3: Spin and spatial wave functions for a two-electron atom with electronic configuration designated 
by the labels A and B. 

of the atom as we shall see below. This is a surprising result when you consider that the spin and spatial 
co-ordinates are basically independent of each other. 



8.3 The Pauli exclusion principle 

Let us suppose that we try to put the two electrons in the same atomic shell. The ground state of helium 
is an example of such a configuration, with both electrons in the Is shell. The spatial wave functions will 
be given by eqn 8.4 with A = B. The antisymmetric combination with the — sign in the middle is zero in 
this case. From Table 8.3 we see that this implies that there are no triplet S = 1 states if both electrons 
are in the same shell. 

The fact that the triplet state does not exist for the helium ground state is a demonstration of the 
rule that L + S must be even if the electrons are in the same shell. In the case of the Is^ configuration, 
we have L = 0, and therefore S' = 1 is not allowed. This rule was introduced without any justification in 
Section 5.9. The general justification of the rule is beyond the scope of this course, but the example of 
the helium ground state at least demonstrates that the rule is true for the simplest case. 

The absence of the triplet state for Is^ configuration is equivalent to the Pauli exclusion principle. 
We are trying to put two electrons in the same state as defined by the n,l,mi quantum numbers. This 
is only possible if the two electrons have different rrig values. In other words, their spins must be aligned 
anti-parallel. The S = 1 state contains terms with both spins pointing in the same direction, and is 
therefore not allowed. The analysis of the symmetry of the wave function discussed here thus shows 
us that the Pauli exclusion principle is a consequence of the fact that electrons are indistinguishable 
fermions. 
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8.3.1 Slater determinants 



We note in passing that the anti-symmetric wave function given in eqn. 8.4 can be written as a determi- 
nant: 



'0spatial 



1 
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UA{ri) 
usiri) 



UA{r2) 
UB{r2) 



This can be generahzed to give the correct anti-symmetric wave function when we have more than two 
electrons: 

Ma(2) ••• Ua{N) 



1 



(8.6) 



where {a, 13, 
and {1,2, • • ■ 



■ ■ ,1^} each represent a set of quantum numbers {n, I, mi,ms} for the individual electrons, 
, A^} arc the electron labels. Determinants of this type are called Slater determinants. 
Note that the determinant is zero if any two rows are equal, which tells us that each electron in the atom 
must have a unique set of quantum numbers, as required by the Pauli exclusion principle. 

We shall not make further use of Slater determinants in this course. They are mentioned here for 
completeness. 



8.4 The exchange energy 

The Hamiltonian for the helium atom before we consider fine-structure effects is given by: 

V 2m 47reori / \ 2m ^ ^near2 J 47reori2 

where ri2 = l^i — r2|. The first two terms enclosed in brackets account for the kinetic energy of the 
two electrons and their attraction towards the nucleus, which has a charge of +2e. The final term is the 
Coulomb repulsion between the two electrons. It is this Coulomb repulsion which makes the equations 
difficult to deal with. 

In § 4.1 and following we described how to deal with a many-electron Hamiltonian by splitting it into 

a central field and a residual electrostatic interaction. In the case of helium, we just have one Coulomb 
repulsion term and it is easier to go back to first principles. We can then use the correctly symmeterized 
wave functions to calculate the energies for specific electronic configurations of the helium atom. 

The energy of the electronic configuration {nih, Uih) is found by computing the expectation value 
of the Hamiltonian: 

^E)=j y^patial^ ^spatial d'Vid^rz. (8.8) 

The spin wave functions do not appear here because the Hamiltonian does not affect the spin directly, 
and so the spin wave functions just integrate out to unity. 

We start by re-writing the Hamiltonian given in eqn 8.7 in the following form: 

H = Hi+H2 + Hi2, (8.9) 

where 

Hi = -^V?--^, (8.10) 
2m 'Insori 

Hi2 = ^ ,. (8.11) 

Anso \ri - r2\ 

The energy can be split into three parts: 

E = Ei + E2 + Ei2, (8.12) 

where: 



and 



/ / <patial^i^spatialdVidV2 , (8.13) 
El2= [ I V':patial^l2V'spatialdVidV2 . (8.14) 
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The first two terms in cqn 8.12 represent the energies of the two electrons in tlie absence of the electron- 
electron repulsion. These are just equal to the hydrogenic energies of each electron: 

^r+^. = -^-^, (8.15) 

where the factor oi 4 = accounts for the nuclear charge. The third term is the electron-electron 
Coulomb repulsion energy: 

^12 = 1 J Cpatial V'spatial dVidVs . (8.16) 

The detailed evaluation of this integral for the correctly symmeterized wave functions given in eqn 8.4 is 
discussed in § 8.7. The end result is: 

Ei2 = Dab±Jab, (8.17) 

where the -|- sign is for singlets and the — sign is for triplets. Dab is the direct Coulomb energy given 
by: 



Dab = -, / u*A{ri)u*B{r2) — ^^(ri) ^^(ra) d-Vi d^^ra , (8.18) 

47reo J J ri2 

and Jab is the exchange Coulomb energy given by 

Jab = ^I [ UA{ri)u*B{r2)—UB{ri)uA{r2)d^rid^r2. (8.19) 

47reo J J ri2 

Note that in the exchange integral, we are integrating the expectation value of l/ri2 with each electron 
in a different shell. This is why it is called the "exchange" energy. The total energy of the configuration 
(niZi, 712/2) is thus given by: 

E{nih, 712^2) = f f+DAB±JAB, (8.20) 

where the + sign applies to singlet {S = 0) states and the — sign to triplets {S = 1). We thus see that 
the energies of the singlet and triplet states diff'er by 2 Jab- This splitting of the spin states is a direct 
consequence of the exchange symmetry. 
Note that: 

• The exchange splitting is not a small energy. It is part of the gross structure of the atom. This 
contrasts with the other spin-dependent effect that we have considered, namely the spin-orbit 
interaction, which is a small relativistic correction and only contributes to the "fine" structure of 
the atom. The value of 2 Jab for the first excited state of helium, namely the ls2s configuration, is 
0.80 eV. 

• We can give a simple physical reason why the symmetry of the spatial wave function (and hence 
the spin) affects the energy so much. If we put ri = r2 into eqn 8.4, we see that we get Vspatiai = 
for the anti-symmetric state. This means that the two electrons have a low probability of coming 
close together in the triplet state, and hence reduces their Coulomb repulsion energy. On the other 
hand, V'spatiai(''i = f2) 7^ for singlet states with symmetric spatial wave functions. They therefore 
have a larger Coulomb repulsion energy. 

• The exchange energy is sometimes written in the form 

A-Bexchange OC - J Si • . (8-21) 

This emphasizes the point that the change of energy is related to the relative alignment of the 
electron spins. If both spins are aligned, as they are in the triplet states, the energy goes down. If 
the spins are anti-parallel, the energy goes up. 

• The notation given in eqn 8.21 is extensively used when explaining the phenomenon of ferromag- 
netism in the subject of magnetism. The energy that induces the spins to align parallel to each 
other is caused by the spin-dependent change of the Coulomb repulsion energy of the electrons. 
The magnetic energy of the electrons due to the dipole-dipole interaction is completely negligible 
on this scale. 
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Figure 8.2: The ionization energies of helium atom. 



8.5 The helium term diagram 

The term diagram for helium can be worked out if we can evaluate the direct and exchange Coulomb 
energies. The total energy for each configuration is given by eqn 8.20. 

The ground state 

In the ground state both electrons are in the Is shell, and so we have a configuration of Is^. We have 
seen above that we can only have S = for this configuration. The energy is thus given by: 



The computation of the direct and exchange energies is non-trivial (to say the least) and keeps theoretical 
atomic physicists busy. The value of 29.8 eV given here can be deduced experimentally from the first 
ionization potential (see below). 

Ionization potentials 

The excited states are made by promoting one of the electrons to higher shells. When the second electron 
has been promoted into the energy continuum at n2 = oo, we are left with a singly ionized helium atom: 
He"*". This is now a hydrogenic system. We have one electron in the Is shell orbiting around a nucleus 
with charge +2e, and the energy is just —Z'^Rh = — 54.4eV. We thus deduce that the first ionization 
potential of helium is —54.4— (—79.0) = 24.6 eV. The second ionization potential (ie the energy required 
to liberate the second electron) is then equal to 54.4 eV. This point is illustrated in Fig. 8.2. 

Optical spectra 

The first few excited states of helium are listed in Table 8.4. We do not need to consider "two electron 
jump" excited states such as the 2s2s configuration here. This is because the Bohr model tells us that 
we need an energy of about 2 x \Rh to promote two electrons to the n = 2 shell. This is larger than the 
first ionization energy. 

For each excited state we have two spin states corresponding to S equal to or 1. The triplet S = \ 
terms are at lower energy than the singlets due to the exchange energy. (See eqn 8.17.) The AS* ~ 
selection rule tells us that we cannot get optical transitions between the singlets and triplet terms. The 
transitions involving singlet states have a normal Zeeman effect since 5 = 0, but the triplet transitions 
have an anomalous Zeeman effect since 5* 7^ 0. 

The energy term diagram for the first few excited states are shown in Fig. 8.3. The energy of the 
(Is, nl) state approaches the hydrogenic energy —Rujr? when n is large. This is because the excited 
electron is well outside the Is shell, which just partly screens the nuclear potential. The outer electron 
just sees Zcff — 1, and we have a hydrogenic potential. 




= -54.4eV-54.4eV + 29.8eV 



79.0 eV . 



(8.22) 



8.6. OPTICAL SPECTRA OF GROUP II ELEMENTS 
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Ground state 


Is Is (= ls2) 


First excited state 


Is 2s 


Second excited state 


Is 2p 


Third excited state 


Is 3s 


Fourth excited state 


Is 3p 


Ionization hmit 


Is ool 



Table 8.4: Electron configurations for the states of the helium atom. 
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Figure 8.3: Approximate energy term diagram for helium. The diagram is split into singlet and triplet 
states because only AS* = transitions are allowed by the selection rules. The energy difference between 
the singlet and triplet terms for the same configuration is caused by the exchange energy, as identified 
for the ls2s configuration. 



Excited states states such as the Is 2s configuration are said to be metastable. They cannot relax 
easily to the ground state. The relaxation would involve a 2s — > Is transition, which is forbidden by the 
Al = ±1 selection rule. Furthermore, the relaxation of the triplet Is 2s configuration is further forbidden 
by the AS* = selection rule. These states therefore have very long lifetimes. 



8.6 Optical spectra of group II elements 

The principles that we have been discussing here with respect to helium apply equally well to other 
two-electron atoms. In particular, they apply to the elements in group IIA of the periodic table (e.g. Be, 
Mg, Ca.) These atoms have two valence electrons in an s-shell outside a filled shell. The term diagram 
for group IIA elements would appear generically similar to Fig. 8.3, and the optical spectra would follow 
similar rules, with singlet and triplet transitions split by the exchange energy. The singlet and triplet 
transitions have normal and anomalous Zeeman effects, respectively. 
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8.7 Appendix: Detailed evaluation of the exchange integrals 

Our task is to evaluate the gross energy for a specific electronic configuration of helium. We restrict 

ourselves to configurations of the type (Is.n^), since these are the ones that give rise to the excited states 
that are observed in the optical spectra. From eqn 8.4 we see that spatial part of the wave function is 
given by: 

*(»'l,T'2) = ^(uis{ri)uni{r2)±Uni{ri)uu{r2)j 

where we take the + sign for singlets with 5 = and the — sign for triplets with S = 1. 
Our task is to evaluate the three terms in eqn 8.12. We first tackle Ei: 

El = J j ^* Hi^ d?rid?r2 

= \j j {<s{'ri)Ki{'r2)±Ki{ri)ul^{r2)) 

Hi (ul,{ri) uli{r2) ± <K^i) ^^(^2)) dVi dVs , 
where the + sign applies for singlet states and the — sign for triplets. This splits into four integrals: 

El = I, [ [ ul^{ri)uli{r2)HiUu{ri)uni{r2)d^rid^r2 



2 



1 /• 

'2 



u*ni{i"i)uis{r2)HiUni{ri)uu{r2)d'^rid'^r2 



±^ J j ul^{ri)uli{r2)HiUni{ri)uu{r2) d^rid^r2 

^\ j j Kii{ri)ul^{r2)HiUis{ri)uni{r2)d^rid^r2. 

We now use the fact that Uni{ri) is an eigenstate of Hi: 

Hi M„/(ri) = Eni Uniiri) , 
and that Hi has no effect on r2, to obtain: 

El = j ul^{ri)uu{ri)d^ri j <;(r2)u„((r2)dV2 

+ \Eni j u*^i{ri)uni{ri)d^ri j u\^{r2)uu{r2)d^r2 
^\Eni j ul^{ri)uni{ri) d^ri j M*;(r2)uis(r2)d^r2 
^\eis j uli{ri)uu{ri)d^ri j uts(r2)u„;(r2)d^r2 
= ^Eu + ^Eni+0 + 0. 

The integrals in the first two terms are unity because the u„; wave functions are normalized, while the 
last two terms are zero by orthogonality. 

The evaluation of E2 follows a similar procedure: 



E2 = j j ^*H2-^d^rid^r2 , 

= +\ j j Uuiri)uli{r2)H2Uu{ri)uni{r2)d^rid^r2 
+ \ J J Ki{ri)uls{r2)H2Uniiri)uuir2)d^rid^r2 
j j Uu{ri)u'^i{r2)H2Uniiri)uu{r2)d^rid^r2 
J J Kiiri)u*isii'2)H2Uni{ri)uu{r2)d^rid^r2 

= +^Er,l + ^Eu+0 + 0. 



8.7. APPENDIX: DETAILED EVALUATION OF THE EXCHANGE INTEGRALS 
Finally, we have to evaluate the Coulomb repulsion term: 

J J 47reori2 

("L(^i) <;(^2) ± <i(r-i) wL,(r2)) d-Vi d'Va , 
where again the + sign applies for singlet states and the — sign for triplets. The four terms are: 

1 f f 1 

Ei2 = +24^ J J Uisiri)u*ni{r2)—uu{ri)uni{r2)d^rid^r2 

1 f f 1 

+ 2^y J Ki{ri)ulM—iini{ri)uu{r2)d^rid^r2 

1 f f 1 

±ol / / "L(^iX;(^2) — w„;(^i)wis(^2)dVidV2 

^ 47reo J J ri2 

1 f f 1 

±2^y y <;(ri)wL(r2)— wis(ri)u„;(r2)dVidV2 

D D J J 

= H ^ ± - ± - , 

2 2 2 2' 

where D and J arc given by cqns 8.18 and 8.19 respectively. 
The total energy is thus given by 

E = Eu + Eni+D±J 

= -4:RH-4:RH/n'^+D±J, 

where the + sign applies to singlets and the — sign to triplets, (cf eqn 8.20 with ni = 1 and n2 
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CHAPTER 8. HELIUM AND EXCHANGE SYMMETRY 



Chapter 9 

Lasers I: Stimulated emission 



9.1 Introduction 

The word "LASER" is an acronym standing for "Light Amplification by Stimulated Emission of Radi- 
ation" . The origins of the laser may be traced back to Einstein's seminal papc;r on stimulated emission 
published in 1917, but it took until 1960 for the first laser to be invented. It is difficult to identify all of 
the key milestones in the history of laser physics, but here are a few of the more important ones: 

1917 Einstein's treatment of stimulated emission. 

1951 Development of the maser by C.H. Townes."'^ 

1958 Proposal by C.H. Townes and A.L. Schawlow that the maser concept could be extended to optical 
frequencies. 

1960 T.H. Maiman at Hughes Laboratories reports the first laser: the pulsed ruby laser. 

1961 The first continuous wave laser is reported: the helium neon laser. 

1962 Invention of the semiconductor laser. 

1964 Nicolay Basov, Charlie Townes and Aleksandr Prokhorov are awarded the Nobel prize for "funda- 
mental work in the field of quantum electronics, which has led to the construction of oscillators and 
amplifiers based on the maser-laser principle." 

1981 Art Schalow and Nicolaas Bloembcirgen are awarded the Nobel Prize for "their contribution to the 
development of laser spectroscopy." 

1997 Steven Chu, Claude Cohen- Tannoudji and William D. Phillips are awarded the Nobel Prize for the 

"development of methods to cool and trap atoms with laser light." 

2005 John Hall and Theodor Hansch receive the Nobel Prize for "their contributions to the development 
of laser-based precision spectroscopy, including the optical frequency comb technique" . 

2010 50th anniversary of the laser. 

Many different types of laser have been developed over the years. The "L" in "laser" stands for "Light", 
but light is understood here in a general sense to mean electromagnetic radiation with a frequency of 
^ 10^^-10^^ Hz, not specifically visible radiation. This provides the first general classification of laser 

types: 

• infrared, visible or ultraviolet wavelength. 
Other general classifications include: 

maser is basically the same as a laser, except that it works at microwave rather than optical frequencies. It took some 
years to move on from masers to lasers because microwave cavities are designed on the assumption that the cavity dimensions 
are comparable to the wavelength of the radiation within the cavity, which is typically around 10 cm at microwave frequencies. 
Such designs cannot be scaled easily to optical wavelengths, where A ~ 1 Jim, and it required some lateral thinking to design 
a cavity that would work in the regime where the cavity dimensions are much larger than the wavelength. It is only relatively 
recently that it has been possible to make "microcavity lasers" and "nanolasers" that have physical dimensions that are 
comparable to the wavelength of light. 
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• solid, liquid or gas gain medium; 

• continuous wave (CW) or pulsed operation; 

• fixed wavelength or tuneable wavelength. 

The gain medium (i.e. amplifying medium) of the laser determines the possible wavelengths that the 
laser can emit, but the characteristic propcirties of laser light are also strongly affected by the design of 
the cavity, which is the other essential part of a laser. Such properties include: 

Monochromaticity Discharge lamps emit light of many different colours simultaneously, according to 
the emission probabilities of the transitions in the atoms. Lasers, by contrast, emit light from just a 
single atomic transition, and are therefore highly monochromatic.^ The transition that is "selected" 
by the laser is determined by the amount of amplification that is available at that wavelength and 
the reflectivity of the mirrors that comprise the cavity. 

Directionality Discharge lamps emit in all directions, but lasers emit a well-defined beam in a specific 
direction. The direction of the beam is governed by the orientation of the mirrors in the cavity. 

Brightness The brightness of lasers arises from two factors. First, the radiation is emitted in a beam, 
which means that the intensity (i.e. power per unit area) can be very high, even though the 
total amount of power is relatively low. Second, all of the energy is concentrated within the narrow 
spectrum of a single atomic transition. This means that the spectral brightness (i.e. the intensity 
in the beam divided by the spectral width of the emission line) is even higher in comparison with 
a white-light source such as an incandescent light bulb. For example, the spectral brightness of a 
1 mW laser beam could easily be millions of time greater than that of a 100 W light bulb. 

Coherence Lasers have a high degree of both spatial and temporal coherence. The coherence of laser 
light will be considered in more detail in Section 10.3. 

These four properties are common to all lasers. In addition, some lasers emit radiation in very short pulses, 

which can be used for studying fast processes in physics, chemistry, and biology, or for transmitting optical 
data at a very high rate down optical fibres. The principles that govern whether a laser can produce very 
short pulses are considered in Section. 10.2.2. 

9.2 Principles of laser oscillation 

As mentioned above, the word "LASER" is an acronym that stands for "light amplification by stimulated 
emission of radiation" . However, there is more to a laser than just light amplification. A laser is actually 
an oscillator rather than just an amplifier.-^ The difference is that an oscillator has positive feedback in 
addition to amplification. The key ingredients of a laser may thus be summarized as: 

LASER = liglit amplification + positive optical feedback . 

Light amplification is achieved by stimulated emission. Ordinary optical materials do not amplify 
light. Instead, they tend to absorb or scatter the light, so that the light intensity out of the medium 
is less than the intensity that went in. To get amplification you have to drive the material into a non- 
equilibrium state by pumping energy into it. The amplification of the medium is determined by the gain 
coefficient 7, which is defined by the following equation: 

I{x + dx) = I{x) + jl{x)dx = I{x) + dl , (9.1) 

where I{x) represents the intensity at a point x within the gain medium. The differential equation can 
be solved as follows: 

7/dx 
7/ 

/(0)e^=^ . (9.2) 

^ "Monochromatic" means "single coloured" . 

more accurate acronym for a laser might therefore be "LOSER", but it is easy to understand why this one never 
caught on. 



dl = 
d/ 
■ • d^ 
.-.lix) = 
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Figure 9.1: Schematic diagram of a laser 



Thus the intensity grows exponentially within the gain medium. 

Positive optical feedback is achieved by inserting the amplifying medium inside a resonant cavity. 
The effects of the cavity on the properties of laser light will be considered in detail in Sections 10.1-10.3. 
At this stage, we confine ourselves to considering the parameters of the cavity that affect the condition 
for laser oscillation. 

Figure 9.1 shows a schematic diagram of a laser. Light in the cavity passes through the gain medium 
and is amplified. It then bounces off the end mirrors and passes through the gain medium again, getting 
amplified further. This process repeats itself until a stable equilibrium condition is achieved when the 
total round trip gain balances all the losses in the cavity. Under these conditions the laser will oscillate. 
The condition for oscillation is thus: 

round-trip gain = round-trip loss 

The losses in the cavity fall into two categories: useful, and useless. The useful loss comes from the 
output coupling. One of the mirrors (called the output coupler) has reflectivity less than unity, and 
allows some of the light oscillating around the cavity to be transmitted as the output of the laser. The 
value of the transmission is chosen to maximize the output power. If the transmission is too low, very 
little of the light inside the cavity can escape, and thus we get very little output power. On the other 
hand, if the transmission is too high, there may not be enough gain to sustain oscillation, and there would 
be no output power. The optimum value is somewhere between these two extremes. Useless losses arise 
from absorption in the optical components (including the laser medium), scattering, and the imperfect 
reflectivity of the other mirror (the high reflector) . By taking into account the fact that the light passes 
twice through the gain medium during a round trip, the condition for oscillation in a laser can be written: 

e2^'i?oci?HRL= 1, (9.3) 

where I is the length of the gain medium, Rqc is the reflectivity of the output coupler, i?HR is the 
reflectivity of the high reflector, and L is the round-trip loss factor due to absorption and scattering, such 
that L = 1 corresponds to the situation with no losses. If the total round-trip losses are small (< 10%), 
then the gain required to sustain lasing will also be small, and eqn 9.3 simplifies to: 

2jl = (1 — ^oc) + (1 ^ -Rhr) + scattering losses -I- absorption losses . (9.4) 

This shows more clearly how the gain in the laser medium must exactly balance the losses in the cavity. 

In general we expect the gain to increase as we pump more energy into the laser medium. At low 
pump powers, the gain will be small, and there will be insufficient gain to reach the oscillation condition. 
The laser will not start to oscillate until there is enough gain to overcome all of the losses. This implies 
that the laser will have a threshold in terms of the pump power. (See Section 9.6.) 



9.3 Stimulated emission 

In Chapter 3, we considered the spontaneous tendency for atoms in excited states to emit radiation. We 
now consider the optical transitions that occur when the atom is subjected to electromagnetic radiation 
with its frequency resonant with the energy difference of the two levels. We follow the treatment of 
Einstein (1917). 

In addition to transitions from the upper to the lower level due to spontaneous emission, there will 
also be: 

• absorption of photons causing transitions from level 1 up to level 2; 
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• stimulated emission in which atoms in level 2 drop to level 1 induced by the incident radiation. 

The process of stimulated emission is a coherent quantum-mechanical effect. The photons emitted by 
stimulated emission are in phase with the photons that induce the transition. This is the fundamental 
basis of laser operation, as the name suggests: Light Amplified by Stimulated Emission of Radiation. 

Consider an atom irradiated by white light, with N2 atoms in level 2 and iVi atoms in level 1. 
The part of spectrum at frequency where hv — {E2 — Ei), can induce absorption and stimulated 
emission transitions. We write the spectral energy density of the light at frequency v as u(iy). The 
transitions that can occur are shown in Fig. 9.2. In order to treat this situation, Einstein introduced 
his A and B coefficients. We have already seen in Section 3.6 that the A coefficient determines the 
rates of spontaneous transitions. The introduction of the B coefficient extends the treatment to include 
absorption and stimulated emission. The transition rates for three processes are: 



Spontaneous emission (2 — > 1): 



Stimulated emission (2 — ^ 1): 



diV2 _ dA^i 
"df ~ dT 

dN2 _ dNi _ 
"dT ^ "dT ~ 



= -A21N2 . (9.5) 



~B2iN2u{iy) . (9.6) 



• Absorption (1 — >• 2): 



dNi _ dN2 

"dT ^ ^^df 



= -B^2Niu{v) . (9.7) 



These are effectively the definitions of the Einstein A and B coefficients. At this stage we might be 
inclined to think that the three coefficients are independent parameters for a particular transition. This 
is not in fact the case. If you know one, you can work out the other two. To see this, we follow Einstein's 
analysis. 

We imagine a gas of atoms inside a box at temperature T with black walls. If we leave the atoms 
for long enough, they will come to equilibrium with the black-body radiation that fills the cavity. In 
these steady-state conditions, the rate of upward transitions must exactly balance the rate of downward 
transitions: 

Bi2Niu{v) = A21N2 + B2iN2u{v) , (9.8) 

which implies that: 

Ni A21 + B2iu{v) 

Furthermore, since the gas is in thermal equilibrium at temperature T, the ratio of N2 to A^i must satisfy 
Boltzmann's law: 

— — exp -— - , (9.10) 
iVi gi \ k^T J 

where 52 and gi are the degeneracies of levels 2 and 1 respectively, and hv = {E2 — Ex). Equations 9.9 
and 9.10 together imply that: 

''^^"^'^^ -^expf-/^V (9.11) 



A21 + B2iu{v) gi \ ksT 
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On solving this for u{i'), we find: 

''^''^"5ii3i2exp(WW-52i?2i' ^ ^ 

However, we know that the cavity is filled with black-body radiation, which has a spectral energy density 
given by the Planck formula:* 

, , Snhu^ 1 , „x 

"^'^^^ c3 exp(WfcBT)-l - ^'-''^ 

The only way to make eqns 9.12 and 9.13 to be consistent with each other at all temperatures and 
frequencies is if: 

51-812 = 32-621 , 

8nhu^ ^ (9-14) 
^21 = ^ — -D21 • 

A moment's thought will convince us that it is not possible to get consistency between the equations 
without the stimulated emission term. This is what led Einstein to introduce the concept. 

The relationships between the Einstein coefficients given in eqn 9.14 have been derived for the case 
of an atom in thermal equilibrium with black-body radiation at temperature T. However, once we have 
derived the inter-relationships, they will apply in all other cases as well. This is very useful, since we 
only nc!cd to know one of the coefficients to work out the other two. For example, we can measure the 
radiative lifetime to determine ^21 from (cf. eqn 3.25), 

^21 = - , (9.15) 
r 

and then work out the B coefficients from eqn 9.14. 

Equation 9.14 shows that the probabilities for absorption and emission arc the same apart from 
the degeneracy factors, and that the ratio of the probability for spontaneous emission to stimulated 
emission increases in proportion to v^. In a laser we want to encourage stimulated emission and suppress 
spontaneous emission. Hence it gets progressively more difficult to make lasers work as the frequency 
increases, all other things being equal. 



9.4 Population inversion 

We have seen above that stimulated emission is the basis of laser operation. We now wish to study how 
we can use stimulated emission to make a light amplifier. In a gas of atoms in thermal equilibrium, the 
population of the lower level will always be greater than the population of the upper level. (See eqn 9.10). 
Therefore, if a light beam is incident on the medium, there will always be more upward transitions due 
to absorption than downward transitions due to stimulated emission. Hence there will be net absorption, 
and the intensity of the beam will diminish on progressing through the medium. 

To amplify the beam, we require that the rate of stimulated emission transitions exceeds the rate of 
absorption.^ We see from eqns 9.6 and 9.7 that this condition is satisfied when: 

B2iN2u{u) > Bi2Niu{v) . (9.16) 

On substituting from eqn 9.14, this leads to the conclusion: 

N2 > —Ni , (9.17) 
51 

which simplifies to: 

N2>Ni, (9.18) 

for non-degenerate levels. This is a highly non-equilibrium situation, and is called population inversion. 
On comparing eqn 9.17 to eqn 9.10, we see that population inversion corresponds to negative temperatures. 
This is not as ridiculous as it sounds, because the atoms are not in thermal equilibrium. 

Once we have population inversion, we have a mechanism for generating gain in the laser medium. 
The art of making a laser is to work out how to get population inversion for the relevant transition. We 
will discuss how this is done for specific types of laser in Sections 10.5-10.6. 

*In a medium with refractive index n, c gets replaced hy v = c/n in eqn 9.13. 

^We can usually ignore spontaneous emission in an operating laser because we axe considering the case in which the light 
intensity is very high, so that the stimulated emission rate far exceeds the spontaneous emission rate. 
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Figure 9.3: (a) Relationship between the intensity / and energy density Ui, of a light beam, (b) Incremental 
intensity increase in a gain medium. 

9.5 Gain coefficient 



Having realized that population inversion gives rise to amplification of light, we now want to work out 
a relationship between the gain coefficient and the population inversion density. Before we can proceed, 
we must first refine our analysis of the absorption and stimulated emission rates. Einstein's analysis 
considered the interaction of an ideal atom with a featureless white-light spectrum. In practice, we are 
more interested in the interaction of real atoms with sharp emission lines with an even narrower band of 
light that will eventually become the laser mode. 

The energy density u{v) that appears in eqns 9.6-9.13 is the spectral energy density (units: J m^'' Hz^^ 
= Jsm^'^). We now consider the interaction between an atom with a normalized line shape function g{i') 
as defined in Section 3.7 (units: Hz"-'^) and a beam of light whose emission spectrum is much narrower 
than the spectral line width of the atomic transition. This situation is considered in Appendix 9.9, where 
it is shown that the rates of absorption and stimulated emission can be written respectively as: (cf. 
eqns 9.36 and 9.37 with the subscript on the laser frequency omitted): 



Wi2 = Bi2Niu^g{iy) 
W21 = B2iN2u^g{v) 



(9.19) 



The light source is considered to have a delta function spectrum at frequency v with total energy density 
Ui, per unit volume (units Jm~'^). Ui, is related to the intensity / of the optical beam by (see Fig. 9.3(a)): 



c 
n 



(9.20) 



where n is the refractive index of the medium. This means that the net stimulated rate downwards from 
level 2 to level 1 is given by: 



W^^' = W21 - W12 = {N2 - N,)B2ig{iy)-I , 



(9.21) 



where we have assumed that the levels are non-degenerate so that B12 — i?2i-^ (See eqn 9.14.) 

For each net transition a photon of energy hv is added to the beam. The energy added to a unit 
volume of beam per unit time is thus Consider a small increment of the light beam inside the 

gain medium with length da;, as shown in Fig. 9.3(b). The energy added to this increment of beam per 
unit time is W2i^hi> x Ada;, where A is the beam area. On remembering that the intensity equals the 
energy per unit time per unit area, we can write: 



dl = W2Thi^dx, 

Tl 

= {N2 - Ni)B2igiiy)-Ihi^ dx . 



(9.22) 



On comparing this to eqn 9.2, we see that the gain coefficient 7 is given by: 

Tl 

7(z/) = {N2 - Ni)B2ig{y)-hu . 



(9.23) 



This result shows that the gain is directly proportional to the population inversion density, and also 
follows the spectrum of the emission line. By using eqn 9.14 to express B21 in terms of A21, we can 



^For non-degenerate levels, the population inversion density AA'' is equal to {N2 — (92/91)^1) rather than just {N2 — Ni). 



9.6. LASER THRESHOLD 
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Figure 9.4: Level scheme for a four-level laser. 

re-write the gain coefficient in terms of the natural radiative lifetime r using eqn 9.15 to obtain: 

A2 



(9.24) 



where A is the vacuum wavelength of the emission line. This is the required result. Equation 9.24 tells us 
how to relate the gain in the medium to the population inversion density using experimentally measurable 
parameters: A, r, n and g{v). 



9.6 Laser threshold 

Equation 9.24 shows us that the gain in a laser medium is directly proportional to the population inversion 
density. Laser operation will occur when there is enough gain to overcome the losses in the cavity. This 
implies that a minimum amount of population inversion must be obtained before the laser will oscillate. 
Population inversion is achieved by "pumping" atoms into the upper laser level. This pumping can be 
done by a variety of techniques, which will be described in more detail when we consider individual laser 
systems in Chapter 10. At this stage we just consider the general principles. 

Lasers are classified as being either three-level of four-level systems. We will consider four-level 
lasers first, as these are the most common. Examples are Helium Neon or Nd:YAG. The four levels are: 
the ground state (0) , the two lasing levels (1 & 2 ) , and a fourth level (3) which is used as part of the 
pumping mechanism. The level scheme for an ideal four-level laser is shown in Fig. 9.4. The feature that 
makes it a four-level as opposed to a three-level laser is that the lower laser level is at an energy more 
than k^T above the ground state. This means that the thermal population of level 1 is negligible, and 
so level 1 is empty before we turn on the pumping mechanism. 

We assume that the atoms are inside a cavity and are being pumped into the upper laser level (level 
2) at a constant rate of M2 • This is usually done by exiting atoms to level 3 with a bright flash lamp or 
by an electrical discharge, and then by a rapid decay to level 2. We can write down the following rate 
equations for the populations of levels 1 and 2: 



dN2 
dt 

"dT 



N2 

T2 
T2 



- W{ 



The various terms allow for: 



/-net 1 








/-net 




12 













(9.25) 



• stimulated transitions from level 2 to level 1 (iVF^ff ), 

• pumping into level 2 (M2), 

• decay from level 1 to the ground state by radiative transitions and/or collisions (Ni/ti). 

Note that W"2i°* is the net stimulated transition rate from level 2 to level 1, as given in eqn 9.21. This is 
equal to the rate of stimulated emission transitions downwards minus the rate of stimulated absorption 
transitions upwards. 

There are two important assumptions implicitly contained in eqn 9.25: 
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Figure 9.5: (a) Variation of the gain and population inversion in a laser with the pumping rate, (b) 
Comparison of the threshold and light outputs for two different values of the transmission of the output 
coupler. 



1. There is no pumping into level 1. 

2. The only decay route from level 2 is by radiative transitions to level 1 (i.e. there are no non-radiative 
transitions between level 2 and level 1, and transitions to other levels are not possible). 

It may not always be possible to satisfy these assumptions, but it helps if we can. That is why we 
described the above scenario as an "ideal" four-level laser. 
We can re-write eqn 9.21 in the following form: 

W^f = B2ig{v)-I{N2 -Ni)= W{N2 - N^) , (9.26) 
c 

where W = B2ig{v)nl / and / is the intensity inside the laser cavity. In steady-state conditions, the 
time derivatives in eqn 9.25 must be zero. We can thus solve eqn 9.25 for Ni and N2 using eqn 9.26 to 
obtain: 



N2 

Therefore the population inversion is given by 



WNi + R2 (9-27) 



W + 1/t2 



AN^N2~N,^ ' (l--) . (9.28) 

W + I/T2 V T2 J 

This shows that the population inversion is directly proportional to the pumping rate into the upper 
level. Note, however, that it is not possible to achieve population inversion (i.e. AiV > 0) unless T2 > ti. 
This makes sense if you think about it. Unless the lower laser level empties quickly, atoms will pile up in 
the lower laser level and this will destroy the population inversion. 
Equation 9.28 can be re- written as : 

R 

AN = — , (9.29) 

where M = M2(l ~ '''i/'''2)- This is the net pumping rate after allowing for the unavoidable accumulation 
of atoms in the lower level because ti is non-zero. If the laser is below the threshold for lasing, there 
will be very few photons in the cavity. Therefore, W will be very small because / is very small: see 
eqn 9.26 above. The population inversion is simply Mt2, and thus increases linearly with the pumping 
rate. Equation 9.24 implies that the gain coefficient similarly increases linearly with the pumping rate 
below threshold. 

Eventually we will have enough gain to balance the round trip losses. This determines the threshold 
gain coefficient 7**^ for laser oscillation, as set out in eqn 9.3 or 9.4. From eqn 9.24 we have: 

^^,,^Wt2 (9.30) 

By combining eqns 9.28 and 9.29 with W = we can work out the pumping rate required to instigate 
lasing. This is the threshold pumping rate. It is given by M*'^ = AA^*'yT2. All lasers have a threshold. 
Unless you pump them hard enough, they will not work. 



9.7. PULSED LASERS 
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Figure 9.6: Level scheme for a three-level laser, for example: ruby. 



What happens if we increase the pumping rate beyond the threshold value? In steady-state conditions, 
the gain cannot increase any more, which implies that the population inversion is clamped at the value 
given by eqn 9.30 even when M exceeds R}^. This is shown in Fig. 9.5(a). What about the power output? 
We set W to zero in eqn 9.29 because there was very little light in the cavity below threshold. This is no 
longer true once the laser starts oscillating. If we are above threshold, AA^ is clamped at the value set 
by eqn 9.30, and so eqn 9.29 tells us that: 

W=x^-- = -fJr-l) ••• forM>M'h. (9.31) 

Now W is proportional to the intensity I inside the cavity (see eqn 9.26), which in turn is proportional 
to the output power P°"* emitted by the laser. Thus P°"* is proportional to W, and we may write: 

M 

Pth 



1 ... for M > M*'' . (9.32) 



This shows that the output power increases linearly with the pumping rate once the threshold has been 
achieved, as shown in Fig. 9.5(b). 

The choice of the reflectivity of the output coupler affects the threshold because it determines the 
oscillation conditions: see eqn 9.3 or 9.4. If the output coupler transmission (1 — Rqc) is small, the laser 
will have a low threshold, but the output coupling efficiency will be low. By increasing the transmis- 
sion, the threshold increases, but the power is coupled out more efficiently. This point is illustrated in 
Fig. 9.5(b). The final choice for Roc depends on how much pump energy is available, which will govern 
the optimal choice to get the maximum output power. 



9.7 Pulsed Lasers 

So far we have only considered continuous wave (CW) lasers, but many lasers in fact operate in a pulsed 
mode. Powerful pulsed flash lamps can give rise to very large pumping rates, with correspondingly large 
output pulse energies, especially when using a trick called "Q-switching" . In this technique, the losses in 
the cavity are kept artificially high by some external method.^ This prevents lasing and allows the build 
up of very large population inversion densities, with correspondingly large gain coefficients. If the losses 
are suddenly reduced, a very powerful pulse will build up because of the very high gain in the cavity. 
Q-switching is widely used in solid-state lasers because they tend to have long upper state lifetimes, which 
allows the storage of a large amount of energy in the crystal, but it is seldom used in gas lasers because 
the lifetimes are shorter which makes it difficult to store much energy in the gain medium. 



9.8 Three-level lasers 

Some lasers are classified as being three-level systems. The standard example is ruby, which was the 
first laser ever produced. The key difference between a three-level laser and a four-level laser is that the 
lower laser level is the ground state, as shown in Fig. 9.6. On comparing Figs. 9.4 and 9.6, it is apparent 
that the lower laser level of the four-level system has merged with the ground state in the three-level 
system. This makes it much more difficult to obtain population inversion in three-level lasers because 



^Onc way to switch the cavity losses from high to low on fast timescales is to use an electro-optical modulator. This 
effectively behaves like a fast intra-cavity shutter. 
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laser 




Figure 9.7: Interaction of an atomic transition with: (a) broad-band radiation, and (b) narrow-band 
radiation. Note that the spectral energy densities and the atomic hne-shape functions are not drawn on 
the same vertical scales. 

the lower laser level initially has a very large population. Let this population be Nq. By turning on the 
pump, we excite dA^ atoms to level 3, which then decay to level 2. Thus the population of level 2 will be 
dA^, while the population of the lower laser level will be (A^o — dA^). Hence for population inversion we 
require dA^ > (A'o — dA^), that is dA^ > No/2. Therefore, in order to obtain population inversion we have 
to pump more than half of the atoms out of the ground state into the upper laser level. This obviously 
requires a very large amount of energy, which contrasts with four-level lasers, where the lower laser level 
is empty before the pumping process starts, and much less energy is required to reach threshold. 

Despite the fact that the threshold for population inversion is very high in a three-level system, they 
can be quite efficient once this threshold is overcome. Ruby lasers pumped by bright flash lamps actually 
give very high output pulse energies. However, they only work in pulsed mode. Continuous lasers tend 
to be made by using four-level systems. 



9.9 Appendix: Interaction with narrow-band radiation 

The Einstein B coefficients were introduced to consider the interaction of atoms with broad-band radi- 
ation, such as black-body radiation, as illustrated in Fig. 9.7(a). In this situation, the spectral energy 
density u{v) varies much more slowly with frequency i> than the atomic lineshape function g{v), and 
may effectively be taken as constant over the line width of the transition. In a laser, by contrast, the 
spectral width of the radiation inside the cavity is frequently much narrower than the width of the atomic 
transition, as illustrated in Fig. 9.7(b). 

Th absorption and stimulated emission transition rates for the case of narrow-band radiation, as shown 
in Fig. 9.7(b), can be calculated as follows. The spectral hne-shape function g{i')dv gives the probability 
that a particular atom will absorb or emit in the spectral range v ^ v + dv. Hence the number of atoms 
in the lower level per unit volume that can absorb radiation in this frequency range is Nig{i')diy. From 
the definition of the Einstein B12 coefhcient given in eqn 9.7, the absorption rate in this frequency range 
is therefore: 

dWi2 = B12 Nig{u)dv u{v) . (9.33) 

The total absorption rate is thus: 

POO 

Wi2^ Bi2Nig{v)u{v)dv . (9.34) 
Jo 

Since the spectral energy density of the radiation inside the laser cavity is much narrower than the width 
of the atomic transition, we can write it as: 

U{v) = UySlu - fc'lasor) , (9.35) 

where it^ is the total energy density of the beam (cf eqn 9.20) and 5{v) is the Dirac delta function. * On 

*The Dirac delta function 8(x — xq) takes the value of at all values of x apart from xq, and is normalized such that 
/q°° 6{x — XQ)dx = 1. It can be thought of as the limit of a top-hat function of width A and height 1/A centred at xq in 
the limit where A — >■ 0. It is easy to show that 



/ f{x)S{x-xo)dx = f{xo) 
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inserting this into eqn 9.34, we obtain: 



Wi2 



/ Bi2Nig{v)u^5{v - l^laser) df . 

Jo 



= Bi2Nig{l^i^sei)Ui' ■ 



(9.36) 



The argument for the stimulated emission rate follows similarly, and leads to: 
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Chapter 10 

Lasers II: Cavities and examples 



In Chapter 9 we pointed out that a laser works by combining an ampUfying medium with a resonant 
cavity. In this chapter we study how the cavity affects the properties of the light emitted by the laser, 
and then look at a few examples of important lasers, paying particular attention to the mechanism that 
produces population inversion. 



10.1 Laser cavities 

The cavity is an essential part of a laser. It provides the positive feedback that turns an amplifier into 
an oscillator, and determines the properties of the beam of light that is emitted, as shown schematically 
in Fig. 10.1. This beam is characterized by its transverse and longitudinal mode structure, which are 
considered separately below. 



10.1.1 Transverse modes 

The transverse modes of a laser beam describe the variation of the electrical field across a cross-sectional 
slice of the beam. The modes are labelled TEMmn where m and n are integers. TEM stands for 
"transverse electro magnetic". If the field is propagating in the z direction, the {x,y) dependence of the 
field is given by: 



\/2a; \ / V2y 



£n^n{x, y) = SoHrn j \w) V J ' ^^^'^^ 

where w is the beam waist parameter that determines the size of the beam, and Hm and iJ„ are 
mathematical functions called the Hermite polynomials.^ The first few polynomials are: 

Hoiu) = 1, 

Hi[u)^2u, (10.2) 
H2{u) = 2{2u^ - 1). 

The most important mode is the TEMqo mode. This has a Gaussian radial distribution: 

^m{x,y) = foexp — = £oCxp ^ , (10.3) 



^Hermite polynomials also appear in the solution of the Schrodinger equation for a simple harmonic oscillator. 
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Figure 10.1: Laser cavity and output beam 
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Figure 10.2: (a) Intensity distribution of a TEMqo mode, which has a Gaussian profile, (b) Beam profiles 
produced by various higher-order laser modes. Note that the side lobes in the x direction for the TEM21 
mode are too faint to be seen on this grey scale. 





Diode laser 


HeNe laser 


Argon ion laser 


L 

n 

Mode spacing 


1 mm 
3.5 
150 GHz 


30 cm 
1 

500 MHz 


2m 
1 

75 MHz 



Table 10.1: Mode spacing for several common laser 



where r is the distance from the centre of the beam, as shown in Fig. 10.2(a). The TEMqo mode is the 
closest thing to a ray of light found in nature. It has the smallest divergence of all the modes and can be 
focussed to the smallest size. We therefore usually try hard to prevent the other modes from oscillating. 
This is achieved by inserting apertures in the cavity which are lossy for the higher-order modes but not 
for the smaller TEMqo mode. 

Figure 10.2(b) compares the beam cross-section for a number of higher-order laser modes with that 
of the TEMoo mode. Note that the TEM,„„ mode has m nodes (zeros) in the x direction and n nodes 
in the y direction. These higher-order modes make pretty pictures, but are not useful for very much. A 
well-designed laser will contain apertures that allow only the TEMoo mode to oscillate. 



10.1.2 Longitudinal modes 

The longitudinal modes determine the emission spectrum of the laser. The light bouncing repeatedly off 
the end mirrors sets up standing waves inside the cavity, as shown in Fig. 10.1. There are nodes (field 
zeros) at the mirrors because they have high reflectivities. Thus there must be an integer number of half 
wavelengths inside the cavity. If the length of the cavity is L, this condition can be written: 

A c 

L = integer x — = integer x , (10-4) 

2 2niy 

where n is the average refractive index of the cavity. In gas lasers, n will be very close to unity. It will 
also be the case that n 1 in a solid-state laser with a short laser rod inside a long air-filled cavity. 
Equation 10.4 implies that only certain frequencies that satisfy: 

c 

v = integer x - — - (10.5) 
2nL 

will oscillate. Most cavities are much larger than the optical wavelength and thus the value of the integer 
in equations 10.4 and 10.5 is very large. ^ The most important parameter is the longitudinal mode spacing: 

Afmodo = 7r~F ■ (10.6) 

2nL 

Table 10.1 lists the longitudinal mode spacing for several lasers. 

^ This is not true for "microcavity lasers" , where we deliberately make the cavity to be of similar dimensions to the 
optical wavelength. In this case the integer would have a value of unity. The use of microcavity semiconductor lasers is 
now widespread in optical fibre systems. 
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Figure 10.3: Multi-mode, single-mode and mode-locked operation 



10.2 Single-mode, multi-mode, and mode-locked lasers 

The gain bandwidth of a laser medium will usually be much wider than the spacing of the longitudinal 
modes of the cavity. This leads to a number of ways of operating the laser. 



10.2.1 Multi-mode and single-mode lasers 

For a given longitudinal mode to oscillate, its frequency must lie within the emission spectrum of the 
laser transition. Unless we do something about it, there will be a tendency for all the longitudinal modes 
that experience gain to oscillate. Therefore the laser will have multi-mode operation, as illustrated in 
Fig. 10.3. As a rough guide, the number of modes that will be oscillating is equal to the gain bandwidth 
divided by the mode spacing. Thus for a 30 cm HeNe laser with a gain bandwidth of 1.5 GHz, there will 
be three modes oscillating. In a Doppler-broadened emission line such as that from the Neon atoms in a 
HeNe laser, the phases of these modes will be random relative to each other because they are emitted by 
different atoms. 

When a laser runs in multi-mode operation, its spectral bandwidth is not significantly smaller than 
that of the light emitted from the same transition in a discharge lamp. For many applications (e.g. 
supermarket bar-code readers), this is not an issue. However, for some others, it is. An obvious example is 
high-resolution spectroscopy. Other examples include those that rely on having high temporal coherence, 
for example: holography and interferometry. This follows from the fact that the temporal coherence is 
inversely proportional to the spectral bandwidth. (See Section 10.3 below.) It is therefore interesting to 
see if we can make the laser run on just a single longitudinal mode. The spectral linewidth would then 
be determined by the quality factor (Q) of the cavity rather than the gain band width. This is called 
single-mode operation. 

One way to achieve to single-mode operation is to shorten the cavity so that the mode spacing exceeds 
the gain bandwidth. See Fig. 10.3. In this case only one mode will fall within the emission line of the 
transition and the laser will automatically oscillate on only one mode. However, this may not be practical. 
For example, in the case of the HeNe laser considered above, we would need to make the cavity shorter 
than 10 cm. Such a laser would have very small round-trip gain, and we would probably not be able to 
make it oscillate. A better way to obtain single mode operation is to insert a narrow frequency filter in 
the cavity such as a Fabry-Perot etalon. By tuning the spacing of the etalon, the frequency of the single 
mode can be changed continuously, which is very useful for high-resolution spectroscopy. The spectral 
line width of a single-mode laser is typically a few MHz. This is about a thousand times narrower than 
the atomic emission line that produces the light. 
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Figure 10.4: Mode- locked laser pulses from a cavity with n — 1. 



10.2.2 Mode locking 



Lasers can be made to operate continuously or in pulses. The length of the pulse might be determined, 
for example, by the duration of the flash-lamp pulse that produces the population inversion, or by the 
properties of a Q-switch. (See Section 9.7.) The pulses produced in this way are relatively long, e.g, 
tens of nanoseconds at best. However, there is another technique called mode locking that leads to the 
emission of a continuous train of very short ("ultrashort") pulses. This is the technique we consider here. 

Mode locking is the opposite extreme to single-mode operation. In a mode-locked laser we try to 
get as many longitudinal modes oscillating as possible, but with all their phases locked together. (See 
Fig. 10.3). This contrasts with a multi-mode laser in which many modes are oscillating but with random 
phases with respect to each other. 

In Appendix 10.7 we prove that the mode-locked operation of a laser corresponds to a single pulse 
oscillating around the cavity and getting emitted every time it hits the output coupler, as shown in 
Fig. 10.4. The time taken for a pulse to circulate around a cavity of length L with n = 1 is 2L/c. 
Therefore we get pulses out of the laser at a repetition rate of {2L/c)^^. 

The minimum pulse duration is set by the Fourier transform of the gain spectrum: 



where Ai/ is the gain bandwidth. This "uncertainty principle" means that to get very short pulses we 
need a wide gain bandwidth. Gas lasers are not very good in this context because they are based on 
fairly narrow atomic transitions. For example, the bandwidth of the 632.8 nm line in the HeNe laser is 
1.5 GHz (see Table 10.2), and so the pulses that can be produced must be at least 0.11ns long. 

The best results have been achieved in tuneable lasers such as dye lasers or titanium-doped sapphire 
lasers. The gain bandwidth of the Ti:sapphire laser is nearly 10^"* Hz, and mode-locked Ti:saphhire lasers 
routinely produce pulses shorter than 100 fs (1 fs = 10~^^ s), which corresponds to millions of longitudinal 
modes oscillating. When the full gain bandwidth of the crystal is used, pulses shorter than 1 fs have been 
produced from this laser. 

Mode locking is achieved by two main techniques. With active mode locking, a time-dependent shutter 
is inserted in the cavity.^ The shutter is opened briefly every 2L/c seconds. Continuous operation of the 
laser is impossible, but the mode-locked pulses will be unaffected by the shutter. In passive mode locking, 
a saturable absorber is inserted in the cavity. Such absorbers have strong absorption at low powers and 
small absorption at high powers. The peak power in the pulsed mode is much higher than in continuous 
operation, and thus the cavity naturally selects the pulsed mode. 

Mode-locked lasers are widely used in scientific research to study fast processes in physics, chemistry, 
and biology. For example, the typical time for a current-carrying electron in a copper wire to interact with 
a phonon at room temperature is about 100 fs. Similarly, the early stages of many chemical reactions or 
biological processes such as photosynthesis take place in less than lO"^'^ s. Another widespread application 
of short pulse lasers in biology is in microscopy. It is common practice to obtain images of biological 
molecules by tagging them with fluorescent chromophores (e.g. dyes, quantum dots) and then exciting 
the sample with a laser in a confocal microscopy. The use of mode-locked lasers gives far superior depth 
resolution compared to continuous wave (CW) lasers.^ 

Mode-locked lasers are also useful to telecommunication companies, who are interested in packing 
as many bits of information (represented by pulses of light) as possible down their optical fibres. The 
shorter the pulses, the higher the data rate. There are also medical applications: it is much cleaner to 

■^The time-dependent shutter is typically made by using a high speed acousto-optic modulator. 

^When a mode-locked laser is used, the higher peak power allows the fluorescent chromophore to be excited by two- 
photon absorption. The power is only high enough for this to occur at the focus of the laser, and so only the part of the 
sample at the focus produces light. With a CW laser, by contrast, the chromophores are excited by standard one-photon 
absorption, and the whole depth of the sample emits light. 



AUi^Au > l/27r , 



(10.7) 
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SOURCE 


Spectral line width 
Ai/ (Hz) 


CohcreiiCG time 

tc (S) 


Coherence length 

Ic 


Sodium discharge lamp (D-lines at 589 nm) 

Multi-mode HcNe laser (632. 8nm line) 
Single-mode HeNe laser (632.8 nm line) 


5 X IQii 
1.5 X 10^ 
1 X 10^ 


2 X IQ-i^ 

6 X 10-1° 
10-6 


0.6 mm 

20 cm 
300 m 



Table 10.2: Coherence length of several light sources. 



use a very short, low-energy, high-peak-power pulse for laser surgery, than a longer pulse with the same 
peak power but much higher energy. 



10.3 Coherence of laser light 

As mentioned in Section 9.1, laser light has a high degree of both spatial and temporal coherence. The 
spatial coherence is related to the phase uniformity across a cross-sectional slice of the beam. When the 

laser is running in a wcU-dcfincd transverse mode, the optical phase across such a slice will be constant. 
Hence the spatial coherence follows from the transverse modes, and will be very high when the laser is 
running on a single transverse mode. 

The temporal coherence of light refers to the time duration over which the phase is constant. In 
general, the temporal coherence time is determined by the spectral line width Au according to: 

t.^±. (10.8) 



Hence the coherence length Ic is given by: 



Ic = etc ~ ^ (10.9) 



Typical values of the coherence length for a number of light sources are given in Table 10.2. The figures 
explain why it is much easier to do interference experiments with a laser than with a discharge lamp. If 
the path difference exceeds Ic you will not get interference fringes, because the light is incoherent. In the 
case of the single-mode HeNe laser, you can set up an interferometer in which the path lengths differ by 
300 m, and you will still observe fringes. The long coherence length of laser light is useful in holography 
and interferometry. 



10.4 Examples of lasers 

There are many different types of lasers in common use, and it is not possible to describe all of them 
here. Most lasers operate at fixed wavelengths: 

Infrared lasers CO2 (10.6 |^m), erbium (1.55 (xm), Nd:YAG (1.064 ^m), Nd:glass (1.054 (xm); 

Visible lasers ruby (693 nm), krypton ion (676, 647 nm), HeNe (633 nm), copper vapour (578nm), 
doubled Nd:YAG (532 nm), argon ion (514, 488 nm), HeCd (442 nm); 

Ultraviolet lasers argon ion (364, 351 nm), tripled Nd:YAG (355 nm), nitrogen (337 nm), HeCd (325 nm), 
quadrupled Nd:YAG (266 nm), cxcimcr (308, 248, 193, 150 nm). 

Others lasers are tuneable, for example: dye lasers (typical tuning range ~ 100 nm, dyes available from UV 
to near infrared); Ti:sapphire lasers (700-1000 nm, doubled: 350-500nm); free electron lasers (far infrared 

to ultraviolet). The most common lasers in widespread use arc semiconductor diode lasers. Cheap 
and efficient diode lasers available at blue (400 nm), red (620-670 nm), and near-infrared wavelengths 
(700-1600 nm). 

In the sections below we consider a few of the more important lasers that are available, following the 
general classification according to whether the gain medium is a gas or a solid. ^ 

^There are relatively few liquid-phase lasers. The most important examples axe dye lasers. However, with the advent of 
broadly-tuneable high power solid-state lasers such as Ti:sapphire lasers, and the development of techniques of nonlinear 
optics to extend their frequency range (see Appendix 10.8), dye lasers are gradually becoming obsolete. 
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Figure 10.5: (a) Schematic diagram of a HeNe laser, (b) Level scheme for the HeNe laser. 



10.5 Gas lasers 

10.5.1 The helium-neon (HeNe) laser 

Helium-neon lasers consist of a discharge tube inserted between highly reflecting mirrors, as shown 
schematically in Fig. 10.5(a). The tube contains a mixture of helium and neon atoms in the approx- 
imate ratio of He:Ne 5:1. By applying a high voltage across the tube, an electrical discharge can be 
induced. The electrons collide with the atoms and put them into excited states. The light is emitted by 
the neon atoms, and the purpose of the helium is to assist the population inversion process. To see how 
this works we need to refer to the level diagram in Fig. 10.5(b). 

Helium has two electrons. In the ground state both electrons are in the Is level. The first excited state 
is the ls2s configuration. There are two possible energies for this state because there are two possible 
configurations of the electron spin: the singlet S' = and the triplet 5 = 1 terms. The helium atoms are 
excited by collisions with the electrons in the discharge tube and cascade down the levels. When they get 
to the ls2s configuration, however, the cascade process slows right down. In the 2sls — > Is^ transition 
one of the electrons jumps from the 2s level to the Is level. This is forbidden by the — ±1 selection 
rule. Furthermore, transitions from the ls2s 8=1 level to the Is^ S = ground state are also forbidden 
by the AS* = selection rule. The net result is that all transitions from the ls2s levels are strongly 
forbidden. The ls2s level therefore has a very long lifetime, and is called metastable. See Section 8.5 
in Chapter 8 for more details. 

Neon has ten electrons in the configuration ls^2s^2p^. The excited states correspond to the promotion 
of one of the 2p electrons to higher levels. This gives the level scheme shown in the diagram. The symbols 
of the excited states refer to the level of this single excited electron. By good luck, the 5s and 4s levels 
of the neon atoms are almost degenerate with the S' = and S = 1 terms of the ls2s configuration of 
helium. Thus the helium atoms can easily de-excite by collisions with neon atoms in the ground state 
according to the following scheme: 

He* +Ne^ Hc + Ne* . (10.10) 

The star indicates that the atom is an excited state. Any small differences in the energy between the 
excited states of the two atoms are taken up as kinetic energy. 

This scheme leads to a large population of neon atoms in the 5s and 4s excited states. This gives 
population inversion with respect to the 3p and 4p levels. It would not be easy to get this population 
inversion without the helium because collisions between the neon atoms and the electrons in the tube 
would tend to excite all the levels of the neon atoms equally. This is why there is more helium than neon 
in the tube. 

The main laser transition at 632.8 nm occurs between the 5s level and the 3p level. The lifetime of 
the 5s level is 170 ns, while that of the 3p level is 10 ns. This transition therefore easily satisfies the 
criterion r"PP"' > r'°^°''. (See discussion of eqn 9.28.) This ensures that atoms do not pile up in the 
lower level once they have emitted the laser photons, as this would destroy the population inversion. The 
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atoms in the 3p level rapidly relax to the ground state by radiative transitions to the 3s level and then 
by coUisional de-excitation to the original 2p level. Lasing can also be obtained on other transitions: for 
example, 5s 4p at 3391 nm and 4s — >■ 3p at 1152 nm. These are not as strong as the main 632.8 nm 
line. 

The gain in a HeNe tube tends to be rather low because of the relatively low density of atoms in 
the gas (compared to a solid). This is partly compensated by the fairly short lifetime of 170ns. (See 
eqn 9.24.) The round trip gain may only be a few percent, and so very highly reflecting mirrors are 
needed. With relatively small gain, the output powers are not very high - only a few mW. However, the 
ease of manufacture makes these lasers to be extremely common for low power applications: bar-code 
readers, laser alignment tools (theodolites, rifle sights), classroom demos etc. They arc gradually being 
replaced nowadays by visible semiconductor laser diodes, which are commonly used in laser pointers. 

10.5.2 Helium- cadmium lasers 

The HeCd laser is another gas laser system based on helium. The population inversion scheme in HeCd 
is similar to that in HeNe except that the active medium is Cd"*" ions. The laser transitions occur in the 
blue and the ultraviolet at 442 nm, 354 nm and 325 nm. The UV lines are useful for applications that 
require short wavelength lasers, such as high-precision printing on photosensitive materials. Examples 
include lithography of electronic circuitry and making master copies of compact disks. 

10.5.3 Ion lasers 

There are several important types of gas lasers that use ions rather than neutral atoms as the gain 
medium, for example, the argon-ion laser. The argon ions are produced by collisions with electrons 
in a discharge tube. The atomic number of argon is 18, and so the Ar"*" ion has 17 electrons in the 
configuration ls^2s^2p^3s^3p^. The excited states of the Ar+ ion are generated by exciting one of the 
five 3p electrons to higher levels, and the most important laser transitions occur between the 4p and 4s 
levels. Spin-orbit coupling splits this into a doublet, with emission lines at 488 nm (blue) and 514.5 nm 
(green). The krypton ion laser works by similar principles, and has a strong laser emission line in the 
red at 676.4 nm. This red line can be combined with the green and blue lines of the argon-ion laser to 
make very colourful laser light shows. 

In addition to laser light shows, argon- ion lasers are frequently used for pumping tuneable lasers such 
as dye lasers and Ti:sapphire lasers. There are also some medical applications such as laser surgery, and 
scientific applications include fluorescence excitation and Raman spectroscopy. 

10.5.4 Carbon dioxide lasers 

The CO2 laser is one of the best examples of a molecular laser. The transitions take place between the 
vibrational levels of the molecule. The strongest emission lines are in the infrared around 10.6 |xm. The 
lasers arc very powerful with powers up to several kilowatts possible. Hence they are used in cutting 
applications in industry (including the military industry!) and also for medical surgery. The high power 
output is a consequence of the fact that the stimulated emission becomes more favourable compared to 
spontaneous emission at lower frequencies: see eqn 9.14. 

A mixture of nitrogen and CO2 in a ratio of about 4:1 is used in the laser tube. The N2 molecules 
are excited by collisions with electrons, and then transfer their energy to the upper level of the CO2 
molecules. This gives population inversion in much the same sort of way as for the HeNe mixture. 

10.6 Solid-state lasers 
10.6.1 Ruby lasers 

Ruby lasers have historical importance because they were the first successful laser to operate. Ruby 
consists of Cr'^+ ions doped into crystalline AI2O3 (sapphire) at a typical concentration of around 0.05% 
by weight. The AI2O3 host crystal is colourless. The light is emitted by transitions of the Cr'^^ impurities. 

The level scheme for ruby is shown in Fig. 10.6(a). Ruby is a three-level system (see Section 9.8), 
with strong absorption bands in the blue and green spectral regions. (Hence the red colour: ruber means 
"red" in Latin.) Electrons are excited to these bands by a powerful flashlamp. These electrons relax 
rapidly to the upper laser level by non-radiative transitions in which phonons are emitted. This leads 
to a large population in the upper laser level. If the flashlamp is powerful enough, it will be possible to 
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Figure 10.6: (a) Level diagram for ruby (Cr'^+:Al203). (b) Schematic diagram of a ruby laser. 
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Figure 10.7: (a) Level diagram for the Nd'^^ lasers, (b) Schematic diagram of an Nd:YAG laser. 



pump more than half of the atoms from the ground state (level 1) to the upper laser level (level 2). In 
this case, there will then be population inversion between level 2 and level 1, and lasing can occur if a 
suitable cavity is provided. The laser emission is in the red at 694.3 nm. 

Figure 10.6(b) shows a typical arrangement for a ruby laser. The crystal is inserted inside a powerful 
helical flashlamp. Water-cooling prevents damage to the crystal by the intense heat generated by the 
lamp. Mirrors at either end of the crystal define the cavity. Reflective coatings can be applied directly to 
the end of the rod as shown, or external mirrors can be used (not shown). The lamps are usually driven 
in pulsed mode by discharge from a capacitor bank. The pulse energy can be as high as 100 J per pulse. 
This is because the upper laser level has a very long lifetime (3 ms) and can store a lot of energy. 



10.6.2 Neodymium lasers (Nd:YAG and Nd:glass) 

Neodymium ions form the basis for a series of high power solid-state lasers. In the two most common 
variants, the Nd"^"*" ions are doped into either Yttrium Aluminium Garnet (YAG) crystals or into a 
phosphate glass host. These two lasers are known as either Nd:YAG or Nd:glass. The main laser 
transition is in the near- infrared at about 1.06 |J.m. The wavelength does not change much on varying 
the host. 

Figure 10.7(a) shows the level scheme for the Nd'^+ lasers, which are four-level lasers. Electrons are 
excited to the pump bands by absorption of photons from a powerful flashlamp or from a diode laser 
operating around 800 nm. The electrons rapidly relax to the upper laser level by phonon emission. 
Lasing then occurs on the ''^F3/2 I11/2 transition.^ The electrons return to the ground state by rapid 
non-radiative decay by phonon emission. 

®This transition is strongly forbidden for free atoms. However, the wave functions of the Nd'^+ ion get distorted in 
the crystal by the electric fields from the neighbouring host atoms, and this relaxes the selection rules. The Einstein A 
coefficient of the f064nm transition in Nd:YAG is 4.3 X 10'' s^^. 
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Figure 10.8: Level diagram for the Ti:sapphire laser. 



Figure 10.7(b) shows the cavity arrangement in a flashlamp-pumped system. The rod and lamp are 
positioned at the foci of an elliptical reflector. This ensures that most of the photons emitted by the lamp 
are incident on the rod to maximize the pumping efficiency. Mirrors at either end of the rod provide the 
optical cavity. The laser can either be operated in pulsed or continuous wave mode. 

As with the ruby laser, the lifetime of the upper laser level is long: 0.2-0.3 ms, depending on the host. 
This long lifetime, which is a consequence of the fact that the laser transition is El-forbidden, allows 
the storage of large amounts of energy. Continuous wave Nd:YAG lasers can easily give 20-30 W, while 
pulsed versions can give energies up to IJ in 10 ns. The pulse energies possible from Nd:glass lasers are 
even higher, although they can only operate at lower repetition rates. The Lawrence Livermore Lab in 
California uses Nd:glass lasers for fusion research. The pulse energy in these systems is ^ 10 kJ. With 
pulse durations in the 10 ns range, this gives peak powers of 10^^ W. 

Nd lasers are extensively used in industry for cutting applications, and in medicine for laser surgery. 
They are very rugged and can be used in extreme conditions (eg onboard military aircraft). Frequency- 
doubled Nd:YAC lasers (see Appendix 10.8) are now gradually replacing argon-ion lasers for pumping 
tuneable lasers such as Ti:sapphire. (See below) 

10.6.3 Ti:sapphire 

Titanium-doped sapphire lasers represent the current state-of-the-art in tuneable lasers. The level scheme 
is shown in Fig. 10.8. The active transitions occur in the Ti'^^ ion. This has one electron in the 3d shell. 
In the octahedral environment of the sapphire (AI2O3) host, the crystal field splits the five m levels of 
the 3d shell into a doublet and a triplet. These are labelled as the and ^T2 states in Fig. 10.8.^ The 
electron-phonon coupling in Ti:sapphire is very strong, and the ^E and ^T2 states are broadened into 
"vibronic" bands. The absorption of the ^E band peaks in the green-blue spectral region, and thus can 
be pumped by the 488 nm and 514 nm lines of an argon ion laser. Alternatively, a frequency-doubled 
Nd:YAG laser operating at (1064/2 = 532 nm) can be used. 

Electrons excited into the middle of the ^E band rapidly relax by phonon emission to the bottom of 
the band. Laser emission can then take place to anywhere in the ^T2 band. The electrons finally relax 
to the bottom of the ^T2 band by rapid phonon emission. 

The fact that tuning can be obtained over the entire ^T2 band is a very useful feature because it 
means that the laser wavelength can be chosen at will. Lasing has in fact been demonstrated all the way 
from 690 nm to 1080 nm i.e. over nearly 400 nm. This is why it makes sense to use one laser to pump 
another: we convert a fixed frequency laser such as the argon-ion or frequency-doubled Nd:YAG into a 
tuneable source. Energy conversion efRciencies of up to 25% are possible. 

The broad emission band width is also ideal for making short pulse mode-locked lasers. (See Sec- 
tion 10.2.2). The shortest pulses that can be produced are given by AtAiy > l/27r. With such a broad 
emission band, it has been possible to generate pulses shorter than 10 fs. 

10.6.4 Semiconductor diode lasers 

Semiconductor diode lasers are by far the most common types of lasers. They are used in laser printers, 
DVD players, laser pointers, and optical fibre communication systems. The laser consists of a semicon- 

'^This notation might be familiar to the Chemical Physieists. The letters are abbreviations for German words. "E" and 
"T" label doublet and triplet states. The superscript of 2 refers to the spin degeneracy. Thus these two states contain 
(2x2-1-2x3 = 10) levels, as we would expect for the 3d states. The subscript of 2 on the triplet state indicates that it has 
a particular symmetry. 
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Figure 10.9: (a) Schematic diagram of the operation of a semiconductor diode laser, (b) Detailed sketch 
of a typical GaAs diode laser chip. 



ductor p-n diode cleaved into a small chip, as shown in Fig. 10.9(a). Electrons are injected into the 
n-region, and holes into the p-region. At the junction between the n- and p-regions we have both elec- 
trons in the conduction band and holes (i.e. empty states) in the valence band. This creates population 
inversion between the conduction and valence bands, and gain is produced at the band gap energy Eg of 
the semiconductor. The electrons in the conduction band drop to the empty states in the valence band, 
and laser photons with energy hv = Eg arc emitted. The drive voltage must be at least equal to Eg/e, 
where e is the electron charge. 

The laser cavity is formed by using the cleaved facets of the chips. The refractive index of a typical 
semiconductor is in the range 3-4, which gives about 30% reflectivity at each facet. This is enough to 
support lasing, even in crystals as short as ~ 1 mm, because the gain in the semiconductor crystal is very 
high. A highly reflective coating is often placed on the rear facet to prevent unwanted losses through this 
facet and hence reduce the threshold. 

The semiconductor must have a direct band gap to be an efficient light emitter. Silicon has an indirect 
band gap, and is therefore not used for laser diode applications. The laser diode industry is based mainly 
on the compound semiconductor GaAs, which has a direct band gap at 1.4 eV (890 nm). A typical design 
of a GaAs diode laser is shown in Fig. 10.9(b). By using alloys of GaAs, the band gap can be shifted into 
the red spectral region for making laser pointers, or further into the infrared to match the wavelength 
for lowest losses in optical fibres (1500 nm). Blue laser diodes are made from the wide band gap III-V 
semiconductor GaN and its alloys. These lasers are used in "blue-ray" systems. 

The power conversion efficiency of electricity into light in a diode laser is very high, with figures of 
25% typically achieved. This compares with typical efficiencies of < 0.1% in gas lasers. Since the laser 
chips are so small, it is possible to make high power diode lasers by running many GaAs chips in parallel. 
Laser power outputs over 20W can easily be achieved in this way. These high power laser diodes are very 
useful for pumping Nd:YAG lasers. 



10.7 Appendix: mathematics of mode-locking 

The electric field of the light emitted by a multi-mode laser is given by: 

£{t) ^^£mexp{iuJrnt + 'Pm) , (10.11) 
m 

where the sum is over all the longitudinal modes that are oscillating. w„ is the angular frequency of the 
mth mode {— rmrc/L, for n = 1), and ifm is its optical phase. In multi-mode operation all the phases 
of the modes are random, and not much can be done with the summation. However, in a mode-locked 
laser, all the phases are the same (call it ipo ) because they have been locked together. This allows us to 
evaluate the summation. 

We assume that all the modes have approximately equal amplitudes Sq. The output field of the 
mode-locked laser is then given by: 

£(t) =£oe''^''^e'"'"*. (10.12) 

m 

Let us suppose that there are A'' modes oscillating, and the frequency of the middle mode is cjq- This 
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Figure 10.10: (a) and (b): frequency doubling, (c) Sum frequency mixing, (d) Phase matching, (e) 
Parametric down conversion. The subscripts p, s, and i stand, respectively, for "pump", "signal" and 
"idler". 



gives the field as 



m' = + (Af-l)/2 

£{t) = Soe''^" exp (i(a;o + m'Tic/L)t) 

m' = -(JV-l)/2 

m'=+(7V-l)/2 , , 

= £:oe'^°e-°* expfi^i 

m'=-(N-l)/2 ^ 



(10.13) 



This type of summation is frequently found in the theory of diffraction gratings. It can be evaluated by 
standard techniques.® We are actually interested in the time dependence of the output power, which is 
given by: 

P{t) £{t)£{t)* . (10.14) 



The final answer is: 



P{t) cx 



sin2(Ar7rc</2L) 
sin2(7rct/2L) 



(10.15) 



This function has big peaks whenever t = integer x 2L/c and is small at all other times. Thus the output 
consists of pulses separated in time by 2L/c. The duration of the pulse is approximately given by the time 
for the numerator to go to zero after one of the major peaks. This time is 2L/Nc. The frequency band 
width Aj/ is equal to the (number of modes oscillating) x (spacing between modes), i.e. /S.v ^ N x c/2L. 
Thus AtAv ~ 1 as expected from the uncertainty principle given in eqn 10.7. 



10.8 Appendix: frequency conversion by nonlinear optics 

It was discovered very early on in the history of lasers that certain crystals could double the frequency 
of laser light, as shown in Fig. 10.10(a). This effect, which is know as frequency doubling, works 
by combining two photons at frequency uj to produce a single photon at frequency 2a;, as shown in 
Fig. 10.10(b). It occurs when a nonlinear crystal is driven by the intense light field produced by 
a powerful laser. The crystal must be non-centro-symmetric: i.e., belong to a crystal class that does 
not have inversion symmetry. Beta-barium borate (BBO), potassium dihydrogen phosphate (KDP) and 
lithium niobate are well-known examples of such crystals. 

Frequency doubling is a specific example of a more general effect called nonlinear frequency mixing. 
In nonlinear frequency mixing, the nonlinear crystal is driven by two intense waves at angular frequencies 
wi and UJ2, and a third wave is generated at the sum of their frequencies, as shown in Fig 10.10(c): 

UJ3 — oJi + UJ2 ■ (10.16) 

^Remember that e"'' = (e")'', and that Yl]Zo = (1 " f")/(l - r). 
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Frequency doubling eorresponds to the case where loi = lj2- The condition in eqn 10.16 is equivalent to 
energy conservation in the photon conversion process. The photon momentum must also be conserved, 
as shown in Fig. 10.10(d), which requires that: 



where k^, ki and k2 are the respective wave vectors inside the crystal. This condition is called phase 
matching, and is, in general, very hard to satisfy. Nonlinear frequency mixing therefore only works 
efficiently for the very specific wavelengths that satisfy the phase-matching condition. These wavelengths 
are selected by the orientation of the crystal. 

The nonlinear process works equally well in reverse, and a single "pump" photon can be split into 
two photons of lower frequency called the "signal" and "idler" photons subject to energy conservation: 



This process is called parametric down conversion and is illustrated in Fig. 10.10(e). Note that there 
is an infinite number of combinations of signal and idler frequencies that can satisfy eqn 10.18, and the 

actual frequencies that are produced are determined by the phase-matching condition. Down conversion 
is a convenient way to generate tuneable radiation from a fixed-frequency laser, and is now widely used 
to extend the range of frequencies available from lasers. 

It should be pointed out that the frequency conversion processes that are considered in this appendix 
are examples of phenomena that are well-known in classical nonlinear optics. The description in terms of 
photons is helpful, but not necessary: all of the effects can have classical explanations. Quantum effects 
do show up when these nonlinear processes are considered at the single-photon level, but this is not the 
regime that is being considered when the driving field is an intense laser beam. 
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^3 = ^1+^2, 



(10.17) 



Ulp=U>s+U!i. 



(10.18) 



